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Foreword 


In fall 2017, the collaboration “Arithmetic Geometry, Number Theory, and Compu- 
tation” was created through awards from the Simons Foundation to our institutions. 
We are grateful for the support of Jim Simons, Yuri Tschinkel, and other leaders at 
the Foundation that made this possible. 

Our common perspective is that advances in computational techniques accelerate 
research in arithmetic geometry and number theory, both as a source of data and 
examples and as an impetus for effective results. The dynamic interplay between 
experiment, theory, and computation has historically played a pivotal role in the 
development of number theory. In the eighteenth and nineteenth centuries, Euler 
and Gauss undertook extensive calculations by hand in the pursuit of data to help 
formulate and refine conjectures and to produce a source of counterexamples. In the 
twentieth century, systematic computations of elliptic curves and their L-functions 
led to the formulation of the Sato—Tate and modularity conjectures, both of which 
have now been proved, and the conjecture of Birch and Swinnerton-Dyer, which 
remains open but has been proved in some special cases. 

In the twenty-first century, the frontier of research in arithmetic geometry has 
moved on to curves of higher genus, abelian varieties, and K3 surfaces. Although 
available computational resources have grown dramatically, the development and 
implementation of practical algorithms have lagged behind the theory; we seek 
to correct this imbalance. In contrast to the situation with elliptic curves, in 
higher dimensions, brute force computation yields very little. To obtain practical 
algorithms, one must exploit the theoretical infrastructure of modern arithmetic 
geometry. 

We pursue these goals by recruiting graduate students and research scientists 
with expertise in both computation and fundamental mathematics. Members of 
our group prove theorems, develop algorithms, explore guiding examples, collect 
data on arithmetic objects, and support scholarly resources like the L-functions and 
Modular Forms Database (LMFDB) http://www.lmfdb.org. 

This volume offers a sampling of our work over the last three years. In addition 
to the six of us, many authors were directly supported by the collaboration: Eran 
Assaf, Alex Best, Raymond van Bommel, Edgar Costa, Alex Cowan, Lassina 
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Dembélé, Maarten Derickx, Sachi Hashimoto, Dohyeong Kim, David Lowry-Duda, 
Benjamin Matschke, David Roe, and Ciaran Schembri. We are also grateful for 
the contributions of senior affiliated scientists Andrew Booker, John Cremona, and 
Kiran Kedlaya. 

A number of manuscripts address fundamental questions informed by com- 
putation or inspired by the need for effective algorithms. Cowan’s contribution 
“Conjecture: 100% of Elliptic Surfaces over Q Have Rank Zero” builds on 
previous work of Nagao and Rosen—Silverman about quantitative measures of 
growth of ranks of fibers of elliptic surfaces. “Congruent Number Triangles with 
the Same Hypotenuse” by Lowry-Duda explores coincidences among solutions to 
the congruent number problem; motivated by observations on Dirichlet series— 
and encouraged by numerical experiments—the author uncovers some interesting 
Diophantine geometry. Poonen offers a new proof of Siegel’s 1926 theorem on 
finiteness of integral points in “The S-Integral Points on the Projective Line Minus 
Three Points via étale Covers and Skolem’s Method.” His work was inspired by, 
but is different from, M. Kim’s proof via a nonabelian analogue of the Skolem— 
Chabauty method. 

Several manuscripts focus on effective approaches to S-unit equations. “A Robust 
Implementation for Solving the S-Unit Equation and Several Applications”’—by 
Alvarado, Koutsianas, Malmskog, Rasmussen, Vincent, and West—presents their 
new algorithms for these problems. Best and Matschke use a reduction to Mordell 
equations to enumerate “Elliptic Curves with Good Reduction Outside of the First 
Six Primes.” 

The development of databases of low genus curves has demanded improved 
techniques for computing their invariants. “Efficient Computation of BSD Invariants 
in Genus 2” by van Bommel discusses quantities associated with the Birch 
and Swinnerton-Dyer conjecture. Picard groups and other invariants of higher- 
dimensional varieties are also of interest. A p-adic variational approach is developed 
in “Effective Obstructions to Lifting Tate Classes from Positive Characteristic” by 
Costa and E. Sertéz. 

The work of our collaboration tends to focus attention on examples with unusual 
properties. Fontaine and Abrashkin showed there are no nonzero abelian varieties 
over Q with good reduction everywhere, and we can enumerate elliptic curves with 
just a few small primes of bad reduction. Mascot, Sijsling, and Voight offer a nice 
example in this vein in “A Prym Variety with Everywhere Good Reduction over 
Q(/61).” The manuscript “On Rational Bianchi Newforms and Abelian Surfaces 
with Quaternionic Multiplication”’—by Cremona, Dembélé, Pacetti, Schembri, and 
Voight—presents beautiful modular forms described in the title and the associated 
very special abelian surfaces. 

Modular forms are a major focus of the collaboration, given their historical 
importance for number theory and across mathematics. Assaf focuses on algorithms 
for nonstandard levels in “Computing Classical Modular Forms for Arbitrary 
Congruence Subgroups.” We offer a comprehensive survey of computations of 
modular forms by Best, Bober, Booker, Costa, Cremona, Derickx, Lowry-Duda, 
Lee, Roe, Sutherland, and Voight. D. Kim lays the groundwork for new algorithms 


Foreword Vii 


in his contribution “Linear Dependence Among Hecke Eigenvalues.” “A Database 
of Hilbert Modular Forms” by Donnelly and Voight presents what is available on 
Hilbert modular forms in the LMFDB. 

One of the most important tools for effective computation of rational points on 
curves is the Chabauty—Coleman method, the prototype for nonabelian extensions 
developed over the last decade. We are especially interested in implementing these 
techniques for large numbers of examples. Contributions in this direction include 
“Curves with Sharp Chabauty—Coleman Bound” by Gajovi¢, “Square Root Time 
Coleman Integration on Superelliptic Curves” by Best, “Computing Rational Points 
on Genus 3 Hyperelliptic Curves” by de Frutos Fernandez and Hashimoto, and 
“Chabauty—Coleman Computations on Rank | Picard Curves” by Hashimoto and 
Morrison. 

Classification of algebraic varieties over finite fields and their Weil polynomials 
is an emerging focus of number-theoretic databases. “Restrictions on Weil Poly- 
nomials of Jacobians of Hyperelliptic Curves”—by Costa, Donepudi, Fernando, 
Karemaker, Springer, and West—develops results that emerged from experiments, 
building on theorems in genus two by Howe, Nart, and Ritzenthaler. Dupuy, Ked- 
laya, Roe, and Vincent present recent progress on building comprehensive databases 
in “Isogeny Classes of Abelian Varieties over Finite Fields in the LMFDB.” 

Finally, we include two manuscripts directly documenting recent innovations to 
the LMFDB. “Zen and the Art of Database Maintenance” by Costa and Roe describe 
their efforts to modernize the internal architecture—how objects are represented in 
the system. This work was crucial to recent expansions to the LMFDB and also 
formed the foundation for the new online directory of research talks and conferences 
http://researchseminars.org supported by the collaboration. “Visualizing Modular 
Forms” by Lowry-Duda is a product of serendipity with the program J/lustrating 
Mathematics held at the Institute for Computational and Experimental Research in 
Mathematics (ICERM) in Fall 2019. The “badges” used to represent modular forms 
on the LMFDB have been updated based on these ideas. We expect these will lead 
to new insights linking modular forms to geometric and probabilistic processes. 
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Abstract Let K be a number field, and S a finite set of places in K containing 
all infinite places. We present an implementation for solving the S-unit equation 
x+y=I1x,ye On s in the computer algebra package SageMath. This paper 
outlines the mathematical basis for the implementation. We discuss and reference 
the results of extensive computations, including exponent bounds for solutions in 
many fields of small degree for small sets S. As an application, we prove an 
asymptotic version of Fermat’s Last Theorem for totally real cubic number fields 
with bounded discriminant where 2 is totally ramified. In addition, we use the 
implementation to find all solutions to some cubic Ramanujan-Nagell equations. 
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2 A. Alvarado et al. 
1 Introduction 


In 1909, Thue proved there are only finitely many integral solutions to what we 
now call the Thue equation; i.e, that for any Q-irreducible binary form F(X, Y) of 
degree at least 3, defined over the integers, there are only finitely many solutions 
(x, y) € Z? to the equation 


F(x, y)=c, 


where c is any non-zero integer [38]. Thue accomplished this by formally factoring 
F into linear terms of the form (x — ay), where a is algebraic, then bounding the 
quality of rational approximations of a in terms of the size of x and y. Thus bounds 
on integer solutions to the Thue equation arose out of the theory of approximating 
algebraic numbers by rationals. Thue’s theorem was generalized by Siegel [33]! 
and then Mahler [25]. These generalizations gave rise to a central fact of modern 
computational number theory: if K is a number field, and S a finite list of places of 
K including all infinite places, then there are only finitely many solutions (x, y) to 
the equation 


x+y=1, x, yE OR g. (1) 


Here, OF x,s 18 the unit group of the ring Ox_s of S-integers in K. We refer to (1) 
as the S-unit equation. In this paper, we describe an algorithm to determine the 
complete set of solutions to the S-unit equation for general K and S. More generally, 
for fixed a,b € Ox.5, we can see that the equation ax + by = 1 will also have only 
finitely many solutions by expanding the set S to include all primes dividing a and 
b and searching for solutions to (1). Thus it suffices to solve (1) to address the more 
general case, and we focus on (1) here (though it should be remarked that this is not 
the most efficient way to solve ax + by = 1). 

The work of Gelfond and Schneider, resolving Hilbert’s seventh problem in the 
affirmative (all irrational algebraic powers of algebraic numbers are transcendental 
once trivial cases are ignored), determined lower bounds on the absolute value 
of a Q-linear combination of two Q-linearly independent logarithms of algebraic 
numbers. Alan Baker’s 1967 theorem [1] generalized these results to the case of 
many logarithms. Baker, Wiistholz, and many others continued to improve these 
bounds. Naturally, one should ask if similar results are available over local fields, 
and indeed such results began to appear quickly. In 1968, Brumer proved the first 
analogue of Baker’s work for p-adic logarithms [8], followed by many improve- 
ments and generalizations, such as the results of Yu [44, 45, 46]. Improvements 
in both the archimedean and nonarchimedean cases continue to appear, such as in 
[20, 4, 47, 19]. 


'See also the recent translation [17] by Fuchs. 
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For any choice of K and S, On 5 is a finitely generated Z-module. Fixing a 
basis p1,..., 0: for the torsion free part, we can express any x € Ons asx = 
g- fies 1 p;' for some root of unity € € K and some a; € Z. Building on the lower 
bounds for linear combinations of logarithms, Gyéry [18] determined effectively 
computable bounds for the exponents a;. This was a great victory for computational 
number theory, as this provably restricted all solutions to (1) to a finite search space. 
Unfortunately, the demonstrated bounds were enormous and as a matter of practice, 
it was computationally infeasible to conduct an exhaustive search for solutions, even 
in the very simplest cases. Baker and Davenport devised a clever method of reducing 
the bounds in special cases in [2]. However, in [14], de Weger built on the ideas of 
Baker-Davenport to develop a powerful general method of algorithmically reducing 
the bounds to a manageable size, relying on the lattice basis reduction algorithm of 
Lenstra, Lenstra, and Lovasz [24] (henceforth referred to as the “LLL algorithm’). 
Though it is has not been proven that de Weger’s method will always reduce the 
bounds coming from the results in linear forms of logarithms, this is the rule in 
practice. In many cases, de Weger’s approach provides sufficient improvements that, 
with careful sieving (or sometimes even with only brute force), the entire search 
space can be exhausted and complete lists of solutions can be enumerated. 

Beyond the improvements provided by LLL-based reduction, many mathemati- 
cians have developed further algorithms for efficiently searching below the “LLL 
bounds” provided by de Weger’s work. Two powerful examples are reported in 
[43] and [37]. Increasingly, the theoretical improvements (assisted by technological 
improvements) have pushed ambitious and interesting computational problems 
within reach. For example, Smart determined the entire set of all genus 2 curves 
over Q with good reduction away from 2, based in part on solving (1) for a family 
of number fields unramified away from 2 [35]. 

We have written a package of Python functions for inclusion in the computer 
algebra system SageMath [31], which solves the S-unit equation (1) over any 
number field K and for any finite set S of finite places. As experienced readers may 
expect, the package is not practical when either [K : Q] or |S| is too large, although 
there is no theoretical obstruction. While this package is the independent creation 
of the authors, it is based in part on the descriptions of algorithms implemented 
by Smart [34, 35, 36]. Specifically, we follow Smart’s development in determining 
initial large bounds, including the numbering of constants, in [34], with some 
adjustments and small corrections. In reducing the bounds, we follow [36], again 
with some adjustments. The sieving step is based on ideas cited by Smart [35] as due 
to others (as noted in Sect. 6) but has been redeveloped in new notation and style. We 
include proofs of our versions of results when we made adjustments to versions in 
the literature. To the authors’ knowledge, our package is the first publicly available 
implementation for solving the S-unit equation over any field other than Q; the 
present article describes the algorithm and its implementation. The implementation 
was a highly non-trivial undertaking, involving efforts spreading over more than 
7 years on the parts of individuals and the entire team. 
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We also provide new results facilitated by our implementation. In particular, we 
first provide a discussion of and link to explicit exponent bounds for solutions of 
the S-unit equation in all cases (K, S) where K /Q is ramified only at primes above 
some subset of {2, 3} and 


[K:Q) <5, SC {po Ox :p | 6}. 


We solve the S-unit equation in the 13 totally real cubic number fields K in which 2 
is totally ramified and the absolute discriminant of K, Ax, satisfies |Ax| < 2000, 
and we use these results to verify that an asymptotic version of Fermat’s Last 
Theorem holds over these fields. Finally, we find all solutions to certain cubic 
Ramanujan-Nagell equations. 


1.1 Overview 


The organization of the paper proceeds as follows. We introduce certain notations 
in Sect. 2. In Sect. 3, we review the relevant work of Baker-Wiistholz and Yu. This 
is used in Sect. 4 to establish a “pre-LLL” exponent bound for each place in S. In 
Sect. 5, we explain the process of using LLL to reduce these exponent bounds— 
the approach is different for archimedean and nonarchimedean places. In Sect. 6, 
we describe the sieve for further constraining the final search space. We devote 
Sect. 7 to a discussion of our experimental observations, having now executed our 
algorithm in several dozen cases. We highlight a special condition (S contains only 
one finite place) under which a significant improvement in the search space can be 
obtained. Although narrow in scope, the special condition is sufficiently natural, 
and the savings sufficiently nontrivial, as to warrant its discussion. Finally, Sect. 8 
introduces two applications: an asymptotic version of Fermat’s Last Theorem over 
totally real cubic fields and a solution to a cubic variant of the Ramanujan-Nagell 
equation. 
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2 Notation 


2.1 §S-Units in Number Fields 


Throughout this paper, we let Q denote the algebraic closure of Q inside C, the 
field of complex numbers. Unless stated otherwise, we fix the following notation 
throughout: 


K a number field (assumed to be a subfield of Q), 
dx the absolute degree [K : Q] 
w the number of distinct roots of unity in K 
AK the absolute discriminant of K /Q 
Ox the ring of integers of K 
ep the ramification index of p in K/Q 
Tp the inertial degree of the prime p C @x over the rational prime 
pnZ 
r the rank of @ as a Z-module 
Sfin aset {p1,..., ps} of s finite places of K 
Soo the set {ps41,..., Pr+s+1} of all infinite places of K 
S Stin U Soo = (Pi, +++ Pres tit 
So the set of places of Q which extend to places of K in S$ 
Ox,S the ring of S-integers in K 
Ons the group of S-units in K 
t the rank of Ons as a Z-module (sot =r-+s) 
Po a root of unity generating the torsion part of On. 5 
Ply ++, Pt an ordered basis for the torsion-free part of the Z-module On. 5 
i) the ordered list [9, 01, ..-, Or] 


If f(x) € Z[x] is a monic and irreducible polynomial, we let K ¢ denote the 
number field Q(é), where & is a root of f(x). Always, log denotes the principal 
branch of the complex logarithm function, with argument in (—z, zr]. 
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2.2 Absolute Values and Completions 


Each place of K determines an associated absolute value, | - |p, which we now 
describe. 
Let | - | denote the usual absolute value on C. If p is an infinite place, choose 


Op: K — C, an embedding corresponding to p. The associated absolute value 
depends on whether p is a real or complex (meaning non-real) place of K: 


lop (a)| p is real, 
lol, = | 2 . 
Op (a) | p is complex. 


Now suppose p is a finite place. View p as a prime ideal of (x, and let p be the 
characteristic of the residue field Ox /p. Let ordy denote the ordinal function for p. 
On @% this is defined by 


ordy(B) =m ifBep™—p™*!, 


and it extends to K * in the obvious way. We let | -|,, denote the usual absolute value 
of the p-adic field Q,,. The absolute value associated to p on K is 


lol _ pf ordp (a) : 
Let Ky be the p-adic completion of K with respect to | - |p; we also use | - |, for the 
absolute value on Ky. 

We fix once and for all an algebraic closure Q p of Qy, and let C, denote the 
completion of Q,. We use | - |, to denote the natural extension of | - |p to all of Cp. 
We define ord, on Cc, to satisfy 

lelp=p or, aeC>. 
As p@x may split into several prime ideals, the absolute value | - |, on Q may have 
several inequivalent extensions to K , of which | - | p is just one; so we must take care 
when viewing Ky as a subfield of Q,. 7 

For any embedding 0: K — Q,, we obtain a subfield of Q, as the composite 
0(K)-Q,. By the Prolongation Theorem [21, $18.5], there exists a choice of 3 such 
that (Ky, | - |p) is value-isomorphic to (0 (K)Qp, | - |»). Henceforth, we always use 
this isomorphism to view K’y as a subfield of Q,,. As the isomorphism respects the 
valuations, we know ord, and ord, satisfy 


ordpy B = ey Ord, B, Be Ky. (2) 
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2.3 Height Functions 


Suppose n > 1. We let h denote the standard logarithmic Weil height on P”(K). 
This is defined as follows: for any x = (x9: +--+: X,) € P"(K), 


1 
hw) = 7 Yo log (max {lxjlp}), 
; | 


where the sum runs over all places of K. It is a consequence of the product formula 
(21, Ch. 20, pgs. 326-327] that h(x) is independent of the choice of coordinates for 
x. For anya € K, set h(a) = h((1: @)). Note that this height is absolute in the 
sense that it is not dependent on which field extension K containing the coordinates 
of x is considered. 

We introduce a modified version of this height function, used in Sect. 3. Suppose 
Q1,...,@, € K, and let K’ = Q(q,...,Q@,) © K. For any nonzero element 
B € K’, we define the function h’ by 


1 
h'(B) = i {dx - h(B), llog B|, 1}. 


, 


The definition of another height function, hy, is slightly more technical and will be 
introduced when needed in Sect. 3. 


2.4 p-Adic Logarithms 


Inside C,,, consider the open disk 
Ai :={z€ Cp: |z— 1p < I. 


On Aj, we define the p-adic logarithm by the series 


iggy (3) 


n 
n>1 
The series is convergent on A;; moreover, on A, it satisfies the identity 
log, (xy) = log, x + log, y. (4) 
ST | 
If |z|p < p ?-' we have 


ord» (log, (1 + z)) = ord, Z. (5) 
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Based on an idea due to Iwasawa, the p-adic logarithm can be extended to any 
z € Cy such that |z|, = 1; this extension continues to satisfy (4) (see [36, II.2.4]). 
2.5 Solutions to the S-Unit Equation 


We let Ax.s denote the additive Z-module (Z/wZ) x Z'. This is isomorphic to 
On s» and the list of generators p determines an isomorphism 


t 
py: Ax.s — O% 5, a= (ao, a1,..-,ar) > | [o%". 
=0 
We use the shorthand p* := ®o(a). For obvious reasons, we call the elements of 


Ax,s exponent vectors. Much of our discussion will focus on bounds for the entries 
of an exponent vector. For a € Ax,s, we use the notation |a| < B to signify 


max |a;| < B. 
O<i<t 


Within 6% 


K.s> we wish to determine 


Xxs = {te Op g:1—-te OF gh. 


Solving the S-unit equation is equivalent to determining the set Xx 5. We let Ex 
denote the corresponding subset @,! (Xx,s) of Ax,s. 


3 The Bounds of Baker-Wiistholz and Yu 


Suppose 71,72 € On gs Provide a solution to the S-unit equation, so that t; + 
T = 1. With respect to the ordered generating set p, there are unique vectors 
b; = (bi.0, ...,bj),4) € Ax,s such that 


t 
bij : 
cae =[]oe’. i= 1,2. (6) 
j=0 


The techniques of lattice reduction discussed in Sect. 5 will not produce an absolute 
bound for [i,j | on their own; they can only be used to improve a known bound. So 
in this section, we recall bounds established by Baker-Wiistholz [3] and Kunrui Yu 
[47]. An excellent treatment of the background material appears in [15]. 
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3.1 Statement of Yu’s Bound 


Let p be a finite place of K, and let p denote the rational prime below p. We let g be 
the smallest rational prime distinct from p (so g = 2 unless p = 2, in which case 
q = 3). Let Gm := exp(2mi/m). We say K satisfies Yu’s auxiliary condition if any 
of the following hold: 


1. gq =2and p/P = 1 mod 4, 
2.qg=2and%e€ K, 
3.q=3and@ € K. 


At the end of this section, we explain how the algorithm finds a bound in cases 
where K does not satisfy Yu’s auxiliary condition. 


Theorem 3.1 (Yu, [47, pg. 190]) Suppose n > 1 and p is a prime of Ox. Suppose 
K is anumber field satisfying Yu’s auxiliary condition and (9, 1, --+, Un—1 € K* 
are chosen which satisfy 


ordy 4) =0, O<j<n-l. (7) 
n—l be 
Suppose bj ¢ Zand © := [| 4,’ # 1. Finally, suppose B satisfies 
j=0 


B => max{|bol, ..-, |Dn—1|, 3}- 
Then there exist explicit constants C i and Q, given below, such that 


ordy (© — 1) < C[Qlog B. 


3.2. The Constants Q and Q’ 


We first discuss the constant Q, and the variant Q’ used in the algorithm. In 
Theorem 3.1, & is roughly a product of the logarithmic heights of the 4;. More 
precisely, decompose the set {jz ;}; into a disjoint union aU 6, where a is a maximal 
subset of {jj}; which is multiplicatively independent. Such a decomposition need 
not be unique. Because of the possible dependence among the jz;, Yu requires a 
modified height function: 


hy(H) := max {ho | . 


(The value x; is explained in the following subsection.) The constant Q, which 
depends on n, dx, p as well as the jz ;, is then defined by 
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Qin, dx, p) = |] r)- [apo 


wea web 


As shown in [47], one may choose any maximal independent set a for the 
computation of Q. If optimization of the bound is critical, one may search over 
all possible a and take the smallest possible bound. This observation is moot in our 
use, however. 


Corollary 3.2 Keeping the hypotheses of the previous theorem, suppose also that 
[M1,---, Un—1 are multiplicatively independent. Set 


n—-1 
Q!(n, dx, p) = hp(uo) | [h)). 
j=l 
Then ordy (© — 1) < CFQ’ log B. 


Proof In this case, b is unique; either b = {uo} or b = ©. In either case, Q < Q’ 
and the result follows immediately. Oo 


In the algorithm, we are always in the situation of the Corollary. Rather than 
decide the question of independence between ju and the other jz;, we just use the 
constant Q’. 


3.3 The Constant CT 


The value of Cf := Cj(n,dx,p) is dependent on n, dx, and p, as follows. Let 
u := ordg w, so that g“ is the q-part of w. Set 


n n+1 
pee egtd MM HD 
n!} 


k 


’ 


q" \ fp log p 


ka := max {log (een + Idx) ep, fp log | : 


n+2 
) - log max{dx, e}, 


Here, e denotes the base of the natural logarithm. The constants a), kK}, and co) 
are given in Tables | and 2. Finally, 


Ci (n, dx, p) := (n+ Dkokskq. (8) 
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Table 1 The constants a“) and x 


Case 


oO) 


11 


ral 
p=2 32 40 
p=3 16 20 
p>3andey >2 | 16 20 
p> 3andey = 1 oe 10 
Table 2 The constant c\) 
p<5 p>s5 
Case cD) Case co) 
p=2 160 p=1(C)andey = 1 1473 
p=3anddx = 1 537 p=1 (4) andey = 2 1502 
p=3anddx >2 759 p=3 (4), é =1,dx =1 1288 
p=Sandey = 1 1473 p=3 (4), ep =1,dx =2 1282 
p=S5andey > 2 319 p=3 (4), ep = 2 2190 


3.4 A Remark About Implementation 


For this subsection only, suppose all hypotheses in Theorem 3.1 are satisfied, except 
K does not satisfy Yu’s auxiliary condition. Set 


“| 


K(G4) g =2, 
K(G3) q =3. 


Let 5B be a prime of Ox above p. Let esp), be the ramification index of ‘PB over p. 


Because 


ordy & = ep|p 


ordsg a, ae kK”, 


we see ordy3 4; = 0 for all 7. Now Theorem 3.1 applies with K’ and ‘P in place of 


K and p, respectively. 


Corollary 3.3 Under the conditions of this subsection, 


ordy (@ — 1) < epip - Ch(n, der, PB) - Q'(n, der, P) - log B. 


Note that even if p splits as $89’ in K’, the choice of $B is irrelevant; both give 
the exact same bound in the Corollary. 
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3.5 Bound of Baker-Wiistholz 


We now give an effective version of Baker’s theorem. (Notations are as in Sects. 2.2 
and 2.3.) 


Theorem 3.4 (Baker-Wiistholz, [3, pg. 20]) Let L be a linear form in t + 1 
indeterminates, 


L(zo,..-, Zt) = bozo +--+ + by 21, b; € Z. 


Let B = max{|bo|, ..., |Dr|}, and let po, ..., Pr € Q-— {0, 1}. Let K' be the subfield 
of Q generated by the p;. If B > 3 and 


A = Lilog po, log o1,..., log p¢) # 0, 


then 


t 


log |A| > —C(t, dx’) log(B) | | h'(p;). 
j=0 


where the constant C(t, dx’) is defined by 
C(t, dxr) = 18(t + 2)'(t + 1) °F” B2dx/)“F log (2(t + 1) dx). 


Note that we may be sure A + O if the set {log p;} is linearly independent over Q. 


3.6 Obtaining the Initial Bound 


The theorems of Baker-Wiistholz and Yu both provide inequalities of the form “a 
polynomial function of B is bounded by a polynomial function of log(B),” which 
in turn guarantee an absolute bound on B. The analysis to determine such a bound 
explicitly is standard; we will use the following result of Peth6 and de Weger for 
this purpose. 

Lemma 3.5 (Pethé and de Weger [29 Lemma 2.2]) Suppose the real numbers 


a,b,h satisfya>0,h>1,b> ey and let x € R be the largest solution to the 
equation 


x =a+b(logx)". 


Then 


h 
x22 (at 4 bi log (n'b)) 
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4 Initial Exponent Bounds 


4.1 An Upper Bound at the Extremal Place 


Suppose (t1, T2) is a solution to the S-unit equation, with t; specified as in (6). 
We set B = max;,; [i,j , and assume B > max{4, w}. Relabeling t, and 7T2 if 
necessary, we assume B = |b, ;| for some 1 < j < tf. Recall that S contains 
precisely t + | places, pj, ..., P41. We choose the indices k, @ € {1,2,...,¢+ 1} 
so that 


flog Itulp,| = max log itilp|, ——|t1lpe = min I7ilp. 

Remark 4.1 In the sequel, we number our constants in an effort to stay consistent 
with the enumeration given in Smart’s paper [34]. There, Smart considers a more 
general unit equation, and so introduces certain constants c4(i), c6(i), c7(Z), ... 
whose values are trivial in the present application. So while the alert reader may 
notice gaps in the enumeration of constants, this is intentional. (Adjusting our 
implementation to the more general setting is not difficult, but we are satisfied to 
limit the discussion to match the current state of the implementation.) 


For any choice of U := {uy,..., u,} C S define the ¢ x ¢ matrix 
M = (m;,;), mij = log |pjlu;- 


One may always choose U so that M is invertible (see [15, §5.1]), and so we assume 
this is the case. We have 


bi4 log |T1lu, 
bi,2 _, | log tile 
: M : 


Dit log |T1 lu, 


Let ||M|| be the row norm of M~!, ice. ||M|| = max; viet |m;,;|, and set 
cy = max {I max {||M|| : is invertible | : 
UcS 


Note that this differs slightly from Smart’s definition, to ensure that c; > 1. Then 
B<c log Til p,|- We define 


1 0.9999999c> 
Qi — cz = 
Cl r+s 
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By Smart [34, Lemma 2], we have 
tile Se": (9) 


We now have an upper bound on |t;|p, in terms of B. We next establish a lower 
bound, also involving B, which will force a limit on the size of B. The precise 
argument depends on whether py, is a finite or infinite place. For the purposes of the 
algorithm, we must compute this bound on B for each possible index 1 < £ < f; 
we have no choice but to take the largest possible bound, i.e., the larger of the two 
values Ko and K, determined in the remainder of this section. 


4.2 Case: pg Is Finite 


If pe is finite, then let pg also denote the associated prime ideal in Ox. Let p be 
the prime of Z lying below pg, and let eg and fe denote the ramification index and 
inertial degree of py over p, respectively. From (9) we have 


Nxq(pe) “Pe < e738 (10) 
Setting 
C3 
Ch) = ee 
ec log Nx /Q(Pe) 
the inequality (10) yields 
3B 


ordy, = epcs(l)B > 0, (11) 


Oe eee 
log Nx/Q (Pe) 


and so ordy, t2 = O. We would like to apply Yu’s Theorem to © = 72, but 
unfortunately the generators po; may have nonzero order with respect to pg. So we 
now replace the o; with a different set of generators, as in [34, pp. 824-825]. First, 
set nj := ordy, p;. Necessarily, there exist indices i for which nj; 4 0. Choose ig so 
that 


\nig| = min{|n;| : ni A O}, 
and now relabel so that i9 = t. For 1 <i < t — 1, define 
ny nj 
i= Pj Pt i) 


so that ord,, 4; = 0. Next, for each i with 1 <i < t—1, choose integers d;, rj such 
that 


O<7r; < |n;| and boi = nidi +77. 
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Necessarily, |d;| < B. Set N := aa NjT;j.- 
Lemma 4.1 We have N = 0 (mod n;). 
Proof Since ordp, (129°) = 0, we know aa njb2,; = 0. Thus, 


1 t=1 t=1 


nb2,. = — Yo nibo,i =n Se nidi = ya 


i=l i=1 i=l 
proving the claim. Oo 


2,0 —No . 


Setting No = x and [lg := po Pr Tz} p;', we have arranged that 


t—1 
n=po] fui, Idi <B, ordp, uj =0. (12) 
i=l 


Since 0 < bo.9 < w and 0 < 7; < |n;|, there are only finitely many possible values 
for fo, and this finite set can be determined without any knowledge of B or the 
b,;. For each j49, we may apply Corollary 3.2 or 3.3 as appropriate, and obtain a 
constant cg (¢, (49) such that 


ord, tT) = ordy,(t2 — 1) < cg(£, [10) log B. 


Setting 


2 


cg(¢) = max | —_ max{cg(, vo))| . 
log2 #0 


we may be sure every S-unit solution satisfies 
ord, T) < cg(€) log B. (13) 
Combining inequalities (11) and (13), we have 


cg(€) 
< 
eecs(€) 


log B. 


Since cy > 1 and cg(£) > e* (log 2)~!, it follows that 
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Applying Lemma 3.5 with a = 0, b = cg(€)/eecs(€), and h = 1, we may conclude 


B < Ko(£):= 200) log (2) : 


eecs(€) e€ecs 


Set 
Ko := max{Ko(€) : pe € Sgn}. 
If £ corresponds to a finite place, then B < Ko. 
In our implementation, the functions mus and possible _mu0Os are used to 
recover the j4; for each finite place pe. The constants cg(£) determined from Yu’s 


Theorem are computed in Yu_bound, while the constant Ko, which may be of 
independent interest, is computed by KO_ func. 


4.3 Case Il: py Is Infinite 


We now assume py is infinite. As in §2.3, we let op, denote the embedding of K 
into C such that 


1 i ] 
lal, = lop (a, where a(e) =} PES 
2 pe is complex. 


We let a denote Op, (a) for any a € K, and we define 


5(£) log 4 c 
cu@:=——=—, ew = FH. 


The condition (9) can now be expressed as 


le | < 3B 
The choices of c1;(€) and c;3(¢) guarantee that 
Been) => |q°|< 
Set A := log ae The estimate | log z| < 2|z — 1| holds for |z — 1] < 1 and so 


JAL = 2|ef - 1] = 2|e{| < 22-08, (14) 


Solving S-Unit Equations 17 


The next step is to view A as a linear form in logarithms and apply the theorem of 
Baker and Wiistholz. Set ¢ := exp a € C. Since fp is a wth root of unity, there 
exists 0 < k < w such that (p?)b2.0 — be. By (6), we have 


b2,0 bo, j 

0)\72: )\?2i 

a ate (()""-T1(0!) 
j=l 


t 
= log a So bo, log pi? +A-2rV-1 
=I 
j (15) 
t 
=klog¢é+ >. bz, ; log o + Aw logg 
j=l 
t 
= (Aw+hk)logg + > bz, ; log a 
j=l 


where we have introduced A € Z to adjust for the principal branch of the logarithm. 
Certainly |A| < tB, and so |Aw+k| < (¢+ 1)Bw. Set 


He is, Awt+tk j=0 
ad | bj j>0 
and L’(zo,..., Z1) = Yj=0 by {2 j- We now have 
e e 
|A| = L' (log, log p| a, ., log pi »| : 


Taking K’ = Q(po,.--, pr) = QE, o(,..., pf), we define 
t 
c1a(0) = Ct, de) [| h'(o)). 
j=0 


(Recall that C(t, dx’) is defined in Theorem 3.4.) We have D5 |< B:=(t+1)Bw. 
Applying Theorem 3.4 to A, we obtain 


log |A| > —c14(2) - log B’ = —c14(€) log((t + 1)wB). 
Combining this inequality with (14), we obtain 


263 OB 5 |A| > ec14(e) log BY (16) 
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This yields the inequality 
B<a(l)+ d(£) log B, 


where 


cy4(€) 
c13(€) 


1 
a(t) := ma® (log2 + ci4(€)log((t+1)w)), b(€) := 


As ¢13(€) < + and cy4(@) > 323, we have a(€) > 0 and b(£) > e”. So by 


t 
Lemma 3.5, B < c15(£) (provided B > c1;(€)), where 


c15(£) := 2(a(e) + b(£) log b(£)). 
Thus, setting 


Ki (€) := max{ci1(€), cis(£)}, 
K, := max{K,(€) : p¢ is infinite}, 


we may be sure B < Kj. In our implementation, the constant K; is computed in the 
function K1_ func. 
Combining all the results of this section, we obtain the following. 


Lemma 4.2 The constant B satisfies B < max{4, w, Ko, Ki}. 


5 LLL Reduction 


In this section we explain how we can reduce the upper bound we have computed 
in Sect.4. This is necessary, because in practice the size of the initial bound is 
extremely large and cannot be used for practical computations. The idea of the 
method we will present here has its origin in de Weger’s thesis [13, 12, 14] where 
he develops a method based on multi-dimensional approximation lattices of linear 
forms of p-adic numbers to solve (among many other equations) S-unit equations” 
over Q. These ideas of de Weger have been extended by himself and others to 
apply over any number field K, and have also been used for the solution of other 
exponential Diophantine equations [39, 40, 41, 34]. 

In the reduction step we use the LLL reduction algorithm on lattices generated 
by integer matrices. So instead of the classical LLL algorithm [24], we use the 
algorithm in [12]. If Wis a lattice in R”, let “* = Y— {0}. For y € R", we define 


21t is worth mentioning the recent results of von Kénel and Matschke [42], who solve S-unit 
equations using modularity. 
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min |x|, ify 
ULZy) =" 


min ||x—y||, otherwise. 
xeL 


Computing the exact value of £(.% y) is a very challenging problem in general. 
Instead, the function minimal vector computes a lower bound using standard 
properties of a reduced basis of a lattice and the LLL algorithm (see [36, Chap- 
ter V]). As in the previous section, we follow Smart’s notation in [34]. Most of the 
material we present in this section can also be found in [36, 15]. 

We preserve the meaning of pz from Sect. 4. When py is a finite place, we let p 
denote the prime of Z lying below pz. We continue to assume B > max{4, w} in 
this section. 


5.1 Finite Places 


Suppose py is a finite place. Set 


1 


cio (€) = 1+ ot” 


and suppose that B > cj6(€). Define Az € Ky, as Az := log, T. Combining (11), 
1 
(2), and B > ci6(€), shows that ord, t; > 1. Thus |ti|p < p »-', and by (5), 


ord, Az = ord, log, T = ord, log, (1 — 7%) =ord, tT > 1. 
Let j2;, dj be as given in (12), so that we have 


t—1 
Ay = log, t = log, Ho + di log, Hi. 
i=l 


Choose @ € Ky, such that Kp, = Q,(@), and let Disc(@) denote the discriminant 
of 6. Set D,(@) = ord, Disc(@) and n = [Ky, : Q,], so thatn = ec fe. 
Expressing Az with respect to the power basis, we obtain Az; € Q, such that 
Ad = ae Ar 0%. Further, we may express 


t-l 
Ark =a0k+ > djaje, aj €Qp, OS Kk Sn-1. (17) 
j=l 
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Using an idea due to Evertse [41, p. 257], we have 


D,(@ 
ord, Az¢ > cs(£)B — wes 


Define 


c17(€) := min {ordyajx: 1 <j <t—1,0<k<n—Jj, 


Dp) 
2 ? 


cig(£) <= cy7 (6) + 


and choose 4 € Q, such that ord, A = c17(¢). 
Should there be some index k such that cj7(¢) > ordp(ao,x), then 
ord, Az ~ = ordy ao,~ < ci7(¢), and consequently 


cig (l) 
< e 
c5(£) 


For the remainder, then, we assume 
c17(€) < min {ordy ag, :O0<k <n— I}. 


By the choice of A, «j,4 := aj,~/4 18 a p-adic integer for all j, k, and we may rewrite 
(17) as 


1-1 
Ad, k Ark 


= = kok + Da dik with ord, (24) > c5(£)B — cy (€). 
j= 


For any a € Z» and a positive integer z, let a”) denote the unique integer 
between 0 and p* such that a = a (mod p%). For a positive integer u, let & 
be the lattice generated by the columns of the matrix 


1 0 0.--- 0 
0 1 0 +. 0 Zittn-1)x(tH4n-1 
(u) ae (u) u 0 € . 
K10 Ki_1.0 P 
(u) ( 
Rigi Kia O p" 


Define 
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Also set 


u+cjg(e) 


KC max {4 w, 
oe 5(0) 


cist . 


The following lemma is a restatement of [34, Lemma 5], and provides an opportu- 
nity to improve the bound on B.? 


Lemma 5.1 If ¢(Z,y) > Vt —1- Ko, then B < Ki" (). 


In the functionp adic LLL bound we have implemented the above analysis. 
In more detail, the functions log_p and embedding_to_Kp are used to compute 
the constants a;,x € Q>» up to a given precision. If this precision is M, i.e., if the 
aj, are stored as pairs of integers modulo p™, we clearly require M > u for the 
algorithm to be meaningful. However, the shift by 4 requires an additional c17(£) 
p-adic digits of precision. So the algorithm checks that M > u + c17(k); if this 
fails, then the p-adic logarithms are computed to higher precision and the process 
is repeated. 

The function log_p is based on an algorithm of Smart in [36, p. 30]. However, 
our implementation also resolves a crucial computational problem in the evaluation 
of log , that to our knowledge has not been mentioned in the literature. To understand 
the issue, we must describe carefully what “computing the logarithm” means in a 
p-adic setting. Let us view K as a subfield of Ky, and specify @1,...,@, € K,a 
Qp-basis for Ky. Suppose w € K and set B := log, a € Ky. Necessarily, there 
exist bj € Qp such that B = )?; bja;. 

As a practical matter, no algorithm can return the true value 6; it can only return 
B € K,an approximation to 6 with |B — Alp very | small. In practice however, we 
require something more specific. We want to find b; € Q such that |b; — b; ilp is 
small for each 7. When p splits in K, the original aleorithen | is not guaranteed to do 
this. 

It can happen that at some other prime p’ above p, a has a negative valuation. 
Consequently, the sum (3) used to compute 8 will not converge p’-adically, and the 
approximations b j are not guaranteed to be p-adically close to the b;. We resolve 
this problem by choosing a suitable element 7 € K with ord, 7 > 0 for all p’ | p, 
such that 7 also satisfies 


ordy (na) = 0, ord, (na) = ordy n = 0. 
Then it holds that 


log, a = log, (na) — log, n 


3Note sq in [34] has the value ¢ — 1 in our notation. 
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By evaluating the difference on the right hand side, these p’-adic divergence issues 
are avoided, and we may be sure that the individual coefficients b; approximate the 
bj p-adically. 

Inp_ adic LLL bound one prime, we attempt to find a value u such that 
Lemma 5.1 applies. If successful, we record the improved bound KEE, The 
improvement offered by Lemma 5.1 depends only on the assumption that p¢ is the 
extremal place, and that some bound Ko > c16(¢) on the exponents is known. So 
we may replace Ko by K*@© and attempt to apply Lemma 5.1 again, possibly 
improving the bound further. Because the application of LLL is very fast compared 
to the sieving step described in Sect.6, the algorithm repeats this process until 
no further improvements can be made to KE): Once each heat) has been 
optimized in this way, the function p adic LLL bound returns 


ky = max{ Ki" (¢) : pe € Stn}. 


5.2 Complex Places 


We now consider the case where pg is an infinite complex place. The reduction is 
quite analogous to the p-adic case; again the standard references are [34, 36, 15]. 
We keep the notations from Sect. 4.3. For0 < j < t we define the complex numbers 


gael et. I= 
: log a j>0. 


AS pz, is an infinite place, we have established already that the by, ; in 


t 
A = log ow = > by jKj 
a 


satisfy the bounds 
b> | <(t+ Bw, [D5 ;| < Bforl <j <t. 


We now attempt to use lattice reduction to improve the bound; the choice of lattice 
and certain constants will depend slightly on whether the «; are all purely imaginary. 
So we define 


a 1 every «; is pure imaginary, 
0 otherwise, 
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and define 
l+o 
S=@=146)K?, r=(=) (t+ w+tw)K. 
( ) 1 J 


If«ky,..., «, are not all pure imaginary, relabel «;,...,«; so that Rex, # 0. Let [-] 
denote the nearest integer function. Now define 


[CRex,] Rex; £0, 
ary i= 
nb Rex; =0. 


Let A be the (f+ 1) x (f+ 1) integer matrix 


1 0 0 0 
c= 0 1 0 0 (18) 
[C-Rex]-:-[C-Rek-1] ay 0 


[C -Imxy] --- [(C -Imx;_1] [C -Imx;] [C - an] 
(By design, the upper left t x ¢ block of A is the identity matrix in case the «; are 
all pure imaginary.) Now, let “be the lattice generated by the columns of A, and 


suppose m_¥ is a positive lower bound for €(Z 0). When pz, is a infinite non-real 
place, we define: 


1 2 
C@ 1= max {s w, log ( e ; | : 
c13(€) (m2, — S)2 —T 


Similar to [36, Lemma VI.2], we have 


Lemma 5.2 Suppose pe is a non-real infinite place. With notation as above, 
suppose C is chosen such that my > 7T24+S. Then B < ie (£). 


Proof There are two cases to consider, as o = 0 or o = 1. Ineach case, our goal is 
to establish the inequality 


Ymy—S—T <2Ce 38, (19) 


for the result follows by isolating B in the inequality (19). 
If the «; are not all pure imaginary, we define 


t 


Oo, := ob [C-Re(x,)], 


j=l 
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t 
2 := Bho [C- =] + Y by ;[C -Im(c)]. 
j=l 
Then note that 
ICA — (®1 + @2V-1)| < T. 
Therefore 
[(®) + &2/-1)| < T+ ICAI. 
We know from (16) that 2e~°3(8 > |A|, so 
|(®, + O/—1)| < T+ 2Ce7 BOF, 
Now notice that the vector 
¥ = (05 1,059,---,05,-1, P1, B2)™ 


is in the lattice Y so |y| > m_y. Further, 
t-1 

my < ly? =) (bh)? + OT + ©} 
j=l 


2 
<(t—1)K?2+ |, 4 ,/=1 
<S+(T +2Ce 638)? 
which implies (19) and the result follows. 
In case the «; are all pure imaginary, the approach is similar. Set 
20 : 
© = Bi gIC- aes Y-d5 [C - Im(«)]. 
j=l 


Similar to the other case, we have ICA - (dV —1)| < TT, and therefore 
|| < T +|CA|. Again applying (16) we obtain 


\O| < T+ 2Ce 6308, 
Now notice that the vector 


¥ = 1, Bho. Boy, OT 
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is in the lattice Y so |y| > m_y. Further, 


t 
my < ly? = 0b}? + ©? 
i=l 


tK? + ||? 


IA 


SAT £3ce OO". 


A 


Again this implies (19). oO 


5.3 Real Places 


Now suppose that p¢ is a real infinite place. Although the arguments in Sect. 5.2 
apply to pe, we can obtain a stronger improvement by analyzing this case separately. 
Replacing p; by —pj as necessary, we may assume pe > 0 for all j with 
Ls j St. 

The mere existence of a real place forces w = 2 and 0 < bog < 1. We set 
Ko := 1/—I and define the real numbers 


As the «; € R for j > 0, we may revisit (15); this time we obtain 
t 
A = log a = Yo bo, jk; 
j=0 


as no adjustments are required to accommodate the branch cut of the logarithm. Set 
Si (= 1K? T := 5(tKi + D, 


and again let “be generated by the columns of the matrix A in (18). When pz, is an 
infinite real place, we define: 


1 2C 
KUL) ‘= max fe w, F log ( : : 
c13(£) (m2, — S)2 —7T 


Lemma 5.3 Suppose that pe is a real infinite place. With notation and definitions 
as above, suppose C is chosen so that my > T2 +S. Then B < | oem (4) 
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Proof If we define 
t 
O, := S > bo, j1C ‘Kj], 2:=b20[C- x], 
j=l 


then we obtain 
ICA — 0; — &2,V—1| <T. 
Observing that the vector 
y = (b2,1, b22, +++, bar-1, ®1, PZT EL, 


and that |b2,;| < B for j > 0, |b2,9| < 1, the remainder of the proof now follows 
the logic of Lemma 5.2 exactly. oO 


5.4 Implementation 


The function minimal vector is used in the implementation to compute a 
value for mp In cx_LLL_bound, we have implemented the reduction step for 
the infinite places applying the above idea. As in the finite case, the parameter 
C is chosen inside the function and changed as necessary to meet the bound 
my > T? + S (keeping in mind, of course, that the definitions of S and T depend 
on the particular place pz). Notice that the proof of Lemmas 5.2 and 5.3 depend 
on obtaining true rounding in obtaining the coefficients of the matrix A. In our 
implementation, we increase precision until this is assured. Similar to the case where 
pg is finite, the improvement of Lemma 5.2 needs only the assumption that pg is the 
extremal place and that some bound kK on the exponents is known. So we apply 
Lemma 5.2 repeatedly until no further improvement to K ae (£) is possible. Once 
this has been done for each infinite place, we set 


‘oa — max{K11"(¢) > pe € Soo}. 


Consequently, we have the following bound which may be passed to the sieve in the 
next section. 


Lemma 5.4 Assume that for each £, a value u = u(£) or C = C(€) exists for which 
the hypotheses of one of the Lemmas 5.1, 5.2, or 5.3 are met. Then the maximum 
exponent B appearing in any solution (tT, t2) of the S-unit equation (1) satisfies 


B< KU" :— max [xt aaa F (20) 
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We use the proof as an opportunity to summarize the algorithm, up to the sieving 
step of the next section. 


Proof There is nothing to show if the solution set is empty, so let us assume 
otherwise. We know the S-unit equation has only finitely many solutions. Keeping 
the notation of (6), let (t;, T2) be a solution where B = |a;| is maximized. One of the 
places in S, say pe, is extremal. If pe is finite, then the work in Sect. 4.2 demonstrates 
B < max{4, w, Ko(£)} by applying one of the corollaries deduced from Yu’s bound. 
If pe is infinite, then the work in Sect. 4.3 demonstrates B < max{4, w, K1(£)} by 
applying the theorem of Baker-Wiistholz. This establishes an absolute bound on B. 

For each possible £, the techniques of this section attempt to replace this absolute 
bound with a smaller bound. There is no mathematical proof that the lattice 
reduction techniques will succeed, i.e., that there will exist appropriate values u 
and C for which Lemmas 5.1, 5.2, or 5.3 apply. However, when they do exist, the 
improved bound is provably correct by the same lemmas. Here, such success is 
presumed for every @, and (20) holds. oO 


In practice, if the hypotheses of Lemma 5.4 are not established, then one only has 
the weaker bounds coming from linear forms of logarithms—these are simply too 
large to allow for a provably complete search. However, the sieve described in the 
next section can still be used up to any prescribed bound Bo; it will find all solutions 
satisfying |aj| < Bo. 


6 Further Reducing the Search Space: Sieving 


The approach taken here, for sieving against primes outside of S, is based on an 
algorithm described by Smart in [34]. Smart credits Tzanakis and de Weger with 
this approach [40]; Tzanakis reports that these ideas date back to Andrew Bremner. 


6.1 Setup for the Sieve 


Recalling the notations of Sect. 2.5, we define for any m > 0, 
AK,S,m = (Z/wZ) x (Z/mZ)’'. 


This finite set will provide a useful search space for exponent vectors in a 
way we will make more precise below. There is an obvious surjective map 
Im: AK,s — AK,s,m. Despite the fact that this map is the identity (and not a 
reduction map) in the Oth coordinate, we will refer to this as the reduction modulo 
m map, and call an element a € Ax 5m an exponent vector modulo m. 

Lett € Ors: The exponent vector for t (relative to p) is ®5!(c). That is, it is 
the unique a € Ax.s such that t = p*. Given any bound B for the exponent vector 
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of at € Xx-\5, we obtain a finite subset of On 5 that contains every solution of the 
S-unit equation. Unfortunately, this is usually still too large of a search space to be 
practical (see Sect. 7), so we must sieve this finite set (or rather, the equivalent finite 
set of exponent vectors) prior to the exhaustive search. The sieve attempts to provide 
an efficient solution to the following problem: 


Problem 6.1 Find a small set Yx,5 satisfying Ex,s C Yx.s C Ax,s. 


If we can find a small enough superset Yx,s5 in a fast enough way, the S-unit 
equation solutions can then be found by brute force search over Yx_5. 

Suppose a € Ax.s. We call b € Ax,s acomplement vector for aif p*+p” = 1. If 
a complement vector exists, it must be unique; the existence of a complement vector 
is equivalent to a € Ex,s, and a pair of complement exponent vectors correspond 
to a solution of the S-unit equation. 

Suppose g € Z is a prime number. We say g avoids S if q ¢ p for all ideals 
p € S.Ifq splits completely in @x, then there are dx prime ideals above q in Ox, 
say q0,---, 4dc—1- We let F,, denote the residue field of q;. Since q is completely 
split, we of course have F,, = F for all j. 

Suppose t € @x,s, and q is a rational prime number which splits completely in 
Ox and which avoids S. The residue field vector for t (with respect to q) is 


dx-1 


tfvg(T) = (T+q0,T+q1,.--,T7 + qdg—1) € I] Fg;: 
i=0 


where t + qj € FQ, is the reduction of t modulo qj. The residue field vector 
depends on the ordering of the primes q; above g; we fix one ordering once and for 
all whenever we consider residue field vectors with respect to q. 

Notice that we have the following commutative diagram, whose horizontal rows 
are exact. 


IL Fs 
ut 


rfv, 
1 —> @%gN(1+46K,s) —> Ck aes rfvq(@x3) —— 1 
ui nm 


(Cic.g)* Po |= zl 


0 ——> {0} x ((¢—1)Z)’ ——> Ags > Axsq-1 —— 0 
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Suppose a € Ax,s,4—1. Since any two lifts a’, a” of a to Ax,s differ by a multiple 
of (q — 1), we see that ®,(a’) and ®,(a”) differ by a perfect (¢ — 1)th power, and 
so determine the same residue field vector. In other words, the dashed arrow in the 
diagram corresponds to a well-defined map Ax,s,g-1 > [|] KF; , and so the notion 
of a residue field vector for a is well-defined. With this in mind, we abuse notation 
slightly and also write rfv,a := rfvg ®p(a’), where a’ € Ax,s is any lift of a. 


Lemma 6.1 Suppose t € Xx,5 and setn = 1—t. Then 


(a) rfvgt +1fvgn = (1,1,...,1) €[]; Faq. 
(b) rfvgt € []; lee 
(c) no entry of rfvgt is 1. 


Proof Since t + n = 1, it follows that for any j, t + 7 = 1 (mod q,), verifying 
(a). As g avoids S, t ¢ qj; for every j. This proves (b). Since (b) holds for both 
and T, (c) follows from (a). oO 


Suppose a is an exponent vector modulo g — 1; ie., a € Ax,s,q—1. We call 
b € Ax,s,q—1 a (q — 1)-complement vector for a if 


rfvq(a) + rfvg(b) = (1, 1,...,1) € | [ Fy. 
i 


Existence of a (¢ —1)-complement vector is a necessary, but not sufficient, condition 
for a to lift to the exponent vector of a unit in a solution to the S-unit equation. 
Further, any particular a may have more than one (q — 1)-complement vector 
associated to it. We set 


Ex,s(q — 1) := {a € Ax,s,q-1 : ahas a (g — 1)-complement vector}. 


6.2 Execution of the Sieve 


The strategy for the sieve is to play the sets Ex,s(gq — 1) off of one another for 
multiple values of g. Choose a finite list Q of rational prime numbers 


Q = [90,%1,---+ 4-11, 
each of which splits completely in K and avoids S, and such that 
Iem(go — 1,gi—1,.--,¢@¢-1 -D = 2B+1. 
Any true solution to the S-unit equation corresponds to exponent vectors found 


in the set Ex,s, and such vectors must reduce modulo (g; — 1) to vectors in 
Ex,s(qj — 1) for each gj € Q. Conversely, given a choice aj € Ex,s(qi — 1) 
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for each 0 <i < k, there is at most one vector a € Ax,s such that 7g,-1(a) = a; 
for each i, while also satisfying |a| < B. Define zrg to be the product of the maps 


Tq;—1: 


TQ: AK,S — | [4x,s.¢-1. 


i 


Certainly we have 


Ex,s Sg ((] x.s@ - 0). 


Because lifts from []; Ex,s(gi — 1) to Ex,s are unique when they exist, 
I], £x,s(q@i — 1) provides a reasonable proxy for the search space. We seek to 
replace each Ex s(qgi — 1) with a subset Y; C Ex ,s(qgi — 1) such that we still have 


Ex.s mo (I | Y;). (21) 


i 


Suppose q;, qj are distinct primes in Q, and suppose a; € Y;, a; € Y;. We say 
a; and a; are compatible if there exists a € Ax,s such that 7,,_;(a) = a; and 

Mq,-1(a) = = aj. Notice that for any i A j, an element a € Ex s reduces modulo 
qi — 1 and qg; — 1 to produce a compatible pair of exponent vectors. 

When a; and a; are compatible, we further call the pair complement compatible 
if there exist b; € Y; and b; € Y; such that 


¢ b; is (g; — 1)-complementary to a;, 
* bj; is (gj; — 1)-complementary to a;, 
* b; and b; are compatible. 


Lemma 6.2 Suppose the sets Y¥; © Ex.s(qi — 1) satisfy condition (21). Further, 
suppose aj € Yj, and set 


Y¥;—{aj} j=i 


If there exists j # i such that Y; contains no vectors which are complement 
compatible to a;, then 


Ex,s&%o (1%. 


In other words, under the given condition, we will lose no true solutions by 
removing a; from Y;. 
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Proof Towards a contradiction, suppose a € Ex 5 satisfies 
Tlg;-1(a) = aj. 
There is a unique b € Ex, satisfying ®,(a) + ®,(b) = 1. Set 
aj =%qj)-1(a), bj =7g,-1(b), bj = 1 q,-1(b). 


Then a; and aj; are compatible by definition. But since a; and a; cannot be 
complement compatible, the vectors b; and bj; cannot be compatible. This is 
impossible, since b € Ex,s. Thus, no such a exists and the claim holds. Oo 


The algorithm based on this lemma is the following. 


Algorithm 6.2 (Sieve) Assume that K, S are fixed and a representation of On s 
has been computed. 


INPUT: Q=[40,41,.--, 4-1] 
OUTPUT: Yo, Y%,..., Ye—1 satisfying (21). 


1. Set Yi — Ex,s(qi — 1) for eachi. 
2. Loop overi € {0,1,...,k — 1}: 


(a) Loop over a; € Y;: 


i. If Y; contains no (g; — 1)-complement vector for a;, remove a; from Y; 
ii. Loop over j € {0,1,..., —l,it+1,...,k-—1}: 


* Ifthere are no a; € Y; which are complement compatible with a;, then 
remove a; from Y;. 


3. Did Step 2 remove any elements from any set Y;? 


¢ If YES, return to Step 2. 
e If NO, then STOP. 


Once the sieve has been completed, we may find all solutions to the S-unit equation 
by doing an exhaustive search over To ; ((], ¥). 


7 Experimental Observations and Computational Choices 


In developing this code and in pursuit of applications, we have computed a very 
large number of examples. Some observations and discussion may be enlightening 
to a reader who wishes to solve the S-unit equation for their own application. 

Our implementation provides the function sieve _below_bound(K, S, B), 
which returns all solutions to the S-unit equation in On ys up to a specified bound 
B (the maximum absolute value of an entry in an exponent vector). This may be 
useful in settings where an exhaustive list of solutions is not needed. For example, 
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in the field Ky with g(x) = x? — 3x +1, and Sgn = {p : p | 2}, the provable LLL- 
reduced exponent bound is 101. However, all solutions actually satisfy the exponent 
bound 5, and the command sieve _ below _bound(K, S, 5) executes in 
under 2 seconds. 


7.1 Sieving vs. Simple Exhaustion 


Once a bound has been reduced as much as possible by LLL, this search space must 
be somehow exhausted. This general problem can be solved in multiple ways. Those 
appearing in the literature can be generally described by the following three ideas: 


(1) simple (non-number theory-based) exhaustion, 
(2) sieve by reducing the problem modulo primes not in S, and 
(3) sieve by reducing the problem modulo powers of primes in S. 


Idea (1) could be looked at through the more general lens of efficient pro- 
gramming, and a good programmer may be able to develop their own code to 
exhaust the search space effectively. The current implementation uses idea (2), 
inspired by Smart’s earlier exposition in [34] and is described here in Sect. 6. 
Item (3) paraphrases an interesting idea which is first due to de Weger in the case 
K = Q [12] and which was generalized to arbitrary number fields K for S-unit 
equations arising from Thue and Thue-Mahler equations by Tzanakis and de Weger 
[39, 40, 41]. Wildanger [43] and Smart [37] worked out the details of the full 
generalization, which was later simplified by Evertse and Gyéry [15]. This is an 
extremely promising and potentially effective method of reducing the search space, 
and has been implemented recently in special cases by several people, including 
Koutsianas [23], Bennett, Gherga, and Rechnitzer [6], von Kanel and Matschke [42], 
and others. Future work will certainly focus on including this sieving technique for 
our functions. 

In all these methods, we begin with the same search space as in (1), and the 
computational complexity of a brute force search is easy to estimate. Let B be a 
bound for the maximum absolute value of an exponent in a solution to the S-unit 


2 
equation. Since we are searching for a pair (Tt), T2) € (G% 3) , the size of our 


search space is given by 
|Ax.s/? = w*QB+4+ 1). 


Thus a naive brute force search has complexity O (w?(2B +1)” ) In practice, a 
simple exhaustive search can be carried out by checking, for each element tT; of 
Ax,s, whether | — t; is an S-unit. Assuming this check has constant time for a fixed 
K and S, we get the less extreme complexity of O (w(2B + i); 

In carrying out computations, we find that the resources required to sieve a search 
space vary greatly, even for number fields of the same degree and S-unit groups of 
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Table 3 Runtimes for 


g(x) t |w |N_ | Runtime (in seconds) 
sieve below bound (K, S, 40) q 5) 
_ = x*—x* 41 2 |12 | 16 1.16 
xt 49 2/4) 0) 2.06 
4412x7418 |2 | 2) 0 | 64 


the same rank. For example, we give the run time for three fields Kz, where Sf is 
the set of primes above 3 in Kg, in Table 3. The column WN gives the total number 
of distinct solutions found. In each case, the LLL-reduced bound is below 40, so 
complete sets of solutions are found in each case. Computations were performed in 
a paid account on the CoCalc platform in late 2018. 

The resources required depend on the size of the search space, but also can vary 
greatly based on the particular list of primes Q chosen for the sieve, and even the 
order of those primes! In many cases, the sieve greatly reduces the time required 
to exhaust the space. In others, a brute force search of the reduced search space 
can actually be a better choice, as the sieving computation can take a mysteriously 
long time. Finding a way to understand and predict these difficulties is a priority for 
future work. The implementation of idea (3) could also make this unnecessary. In 
all cases, it is worthwhile to find the smallest reduced bound possible, whether as 
input for the built-in sieve or for use in a brute force search. 


7.2 Finite Place vs. Infinite Place Bounds 


In general, we find that the LLL-reduced bounds corresponding to pg infinite are 
smaller than the bounds for pg finite. To illustrate this, let % be the set of 85 number 
fields K satisfying 


1<[K:Q] <5, Ag = £243", 


If N € Z, we set 


Stin,k,N (= {p © Ox : p | N}, 


Sk,N i= Stin,k,N U Soo. 


For any choice of K € “and S = Sx.y where N € {2,3,6}, we have 
computed the LLL-reduced bounds under the assumption that pg is finite and under 
the assumption that pg is infinite. Complete bound data is available by email request 
to authors Malmskog or Rasmussen. Here we will consider only the case S = Sx. 
Now, let Bj(K) and B2(K) be the bounds obtained in Sect. 5 under the assumption 
that pe is a finite or infinite place, respectively. In Fig. 1, we plot both B,(K) and 
B2(K) against the root discriminant of K (which ranges from 1.74 to 26.56 in .%). 
The bound B,(K) usually exceeds B2(K), on average by a factor of + 3.00. 
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Fig. 1 Bounds B)(K), B2(K) (vertical axis) for S = Sx and K € -% plotted in order of 
increasing pags 


Because the disparity between these bounds is so large, we would prefer to 
use Bo(K). Generally, we have no control over whether p, is finite or infinite. 
However, if S contains only one finite place, a small trick allows us to use B2(K). If 
(t1, 12) € Ox g isa solution to the S-unit equation, note that (4 ; a and (4 : a ) 
are also S-unit equation solutions. We define the solution cycle of tT, to be 


1 1 1 1 
C(t) = ™j,1-%, pL = ’ = . 
TI TI 1- tT 1-t 


The following result is a restatement of [26, Lemma 6.3]. 


Lemma 7.1 Let K be a number field, and suppose S is a finite set of places of 
K containing all infinite places and at most one finite place, (i.e. |Sfin| = 1). Let 
(T1, T2) be a solution to the S-unit equation over K. Then at least one element of 
C(t) belongs to a solution with pe corresponding to an infinite place. 


This implies that under the hypothesis of the lemma, some representative of each 
solution cycle has an exponent vector bounded by B2(K); recovering the entire 
solution cycle from one representative is trivial. Thus, we can determine all solutions 
to the S-unit equation. 
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Fig. 2 R(K) (vertical axis) versus eee (horizontal axis) for K € 4 S = Sx 2, and |Sgn| = 1 


It may seem that the hypothesis of Lemma 7.1—that there is only one finite 
place in S—is a rather specialized condition. However, many interesting arithmetic 
applications involve searching for objects with “good” behavior away from one 
prime p. In such cases, we take S = Sx_». Should p ramify in K, the condition 
|Sfin| = 1 is equivalent to p being totally ramified, and this is not so uncommon 
when [K : Q] is small. Here, with § = Sx 2, the lemma applies for 72 of the 85 
number fields in 

To illustrate the utility of Lemma 7.1, consider the ratio of the sizes of the search 
spaces for two bounds B,(K) and B2(K), given by 


_ w*(2B\(K) + 1) Bi(K)\™ 
RE) = 2B (K) + 2 (Fm) 


This quantifies the potential savings when the better bound may be used. For 
S = Sx, Fig. 2 plots the savings R(K) against the root discriminant of K for the 
72 fields K in “% for which |Sgn| = 1. 


8 Applications 


A major application of solving S-unit equations is in enumerating solutions to 
Shafarevich-type problems, for example finding complete lists of curves of a given 
type with particular reduction properties. The blueprint for this implementation 
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came from Smart’s 1997 enumeration of all genus 2 curves over Q with good 
reduction away from p = 2 [35], building off earlier work with Merriman [27]. 
In 2017, Malmskog and Rasmussen used these methods to determine all Picard 
curves defined over Q with good reduction away from p = 3 [26]. The same year, 
Koutsianas produced a new algorithm that uses solutions to the S-unit equation to 
find all elliptic curves over an arbitrary number field having good reduction outside 
S [23]. In the remainder of this article, we provide some new applications of the 
implementation. 


8.1 Asymptotic Fermat 


Let K /Q be a number field. We consider the nontrivial solutions (a, b,c) € K 3 to 
the Fermat equation: 


Cy ia? +b? +c? = 0, abc #0, p> 3aprime. 


For fixed p, it follows from the work of Faltings that C,(K) is finite, but it is 
reasonable to ask whether (_J e C,(K) is finite or infinite. Finiteness is equivalent to 
the condition that C,(K) = @ for sufficiently large p. We say K satisfies asymptotic 
Fermat if there exists a bound Bx such that p > Bx implies Cp)(K) = ©. 

There are several number fields K known to satisfy asymptotic Fermat: Jarvis- 
Meekin [22] demonstrate that K = Q(V2) satisfies asymptotic Fermat with 
Bx =4. Freitas-Siksek give an explicit family of real quadratic fields of density 
> - which satisfy asymptotic Fermat. They also report that the real quartic field, 


K=Q(v2+ /2) satisfies asymptotic Fermat. 

In [16], Freitas and Siksek find a condition on a totally real field K which 
guarantees that K satisfies asymptotic Fermat. For the remainder, suppose K is 
totally real. Define 


S = {p: panonzero prime ideal of Gx which divides 2}, 
T={peS: fy =I}. 


Theorem 8.1 (Freitas-Siksek) Let K /Q be a totally real number field, with either 
[K : Q] odd or T nonempty. Suppose that for every solution (t, t2) to the S-unit 
equation, there is some p € T such that max{| ordy(t})|, | ordp(t2)|} < 4 ordy (2). 
Then K satisfies asymptotic Fermat. 


Remark 8.1 We note that Freitas-Siksek’s result is actually stronger, and they 
provide additional conditions under which K must satisfy asymptotic Fermat. Also, 
more recent work of Sengiin-Siksek [32] provides similar criteria for arbitrary 
number fields. However, the above formulation is sufficient for our application. 


The reader may recall that Wiles’s classic proof of Fermat’s Last Theorem 
proceeds by taking a hypothetical solution (a, b, c) and noting that the associated 
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Frey elliptic curve is forced to satisfy an impossible set of constraints (that the curve 
is not modular). Freitas and Siksek’s approach is similar. Given a solution to Cp 
over K, they produce an elliptic curve E/K (related to, but distinct from, the Frey 
curve) whose j-invariant is arithmetically constrained. However, the j-invariant is 
determined by the A-invariants of E; these A are guaranteed to arise as solutions to 
the S-unit equation over K. The result above follows from a delicate analysis of how 
these constraints interact. 

We report a new list of cubic number fields K/Q which satisfy asymptotic 
Fermat. Using the implementation of the algorithm described in this paper, we find 
all solutions to the S-unit equation (S as above), and verify the condition of Freitas- 
Siksek (this last step is trivial once all solutions have been determined). 

Let #x denote the set of totally real cubic number fields in which 2 is totally 
ramified and which have absolute discriminant Ax satisfying |Ax| < X. Table 4 
lists all the fields of “y for X = 2000. For each K € -%x, we solved the 
appropriate S-unit equation, and by applying Theorem 8.1, verified that K satisfies 
asymptotic Fermat. Our results are not effective, as Theorem 8.1 does not provide 
the bound Bx. 

For each K € 42000, fx denotes a minimal polynomial for K/Q; Ax is the 
absolute discriminant of K. Because 2 is totally ramified, Lemma 7.1 guarantees 
that every solution cycle will contain a solution with the extremal place pe 
infinite. Consequently, each solution cycle will contain at least one solution (11, T2) 
satisfying 


Tj = p™, laj| < Ky". 


(Finding the remaining solutions in the solution cycle is trivial even if they do not 
satisfy this bound.) 

Finally, N(S, K) indicates the number of distinct solutions (t1, T2) to the S-unit 
equation found. (These are unordered solutions, so that (t), T2) and (T2, T)) are not 


Table 4 Fields in 762000 and fr Ax KtLL N(S, K) 

ectieas - S-unit equation ee re a 25 53 
x3—x?—5x—1 |2?-101 175 | 11 
x3 —x?—5x 43 |2?-3-47 | 156 5 
3_ 6, —2 2?.33.7 | 161 5 
x3 —x?—7Tx—3 | 27.197 156 8 
x3 — 8x —6 2? . 269 176 8 
x? — 10x — 10 27.57.13 | 156 8 
x3—x?—Tx+5 | 27.349 199 8 
xe —x7- 9x —5 | 27.373 162 8 
xe —x?—Tx +1 | 2?-3-127 | 180 2 
x3 —x2—9x +11 | 2?-389 198 8 
3_ 12x —14 2?.34.5 | 164 2 
x3 — 8x -2 27.5-97 | 176 5 
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Table 5 Solutions to (22) 


? q |x k |n \q x k |n 
ae a i | 2lil1| | 4214 
Tass cesar 
17| 2[2 11 li79 |-4 [5 [1 
19 |—2 [3 |1 [251 | 2 (5 (1 
67 | 4a la |307 | 415 |a 

73 |-2 [4 [1 


considered distinct.) The reader should note that the two trivial solutions over Q, 
(—1, 2) and G: 5)» are counted in each field K. 


8.2 Cubic Ramanujan-Nagell Equations 


In 1913, Ramanujan conjectured that the only solutions of the Diophantine equation 
x* +7 = 2” over the natural numbers satisfy x € {1,3,5, 11, 181} [30]. This was 
settled in 1948 by Nagell [28]. The more general family of equations, 


Ax? +B=C", A,B,CeZ 


are called Ramanujan-Nagell equations, and the literature for solving such equa- 
tions is very rich (see for example [10, 9, 11, 7]). Very recently cubic Ramanujan- 
Nagell equations, have attracted the attention of mathematicians [5]. These are 
equations of the form 


f@=C", f(x) € Z[x], deg f =3,C €Z. 
We consider the particular example 
4+3%=q", q>3prime, n,k>0. (22) 
If gq = 2, a more general version of (22) is solved in [5]. Here, we prove the 


following theorem. 


Theorem 8.2 Let q be a prime with 3 < q < 500. All integer solutions of the cubic 
Ramanujan-Nagell equation (22) with k,n > 0 are listed in Table 5. 


Our method also works for the equation x? + p* = q”, where p, q are different 
odd primes, and the proof is similar to the case p = 3. 


Proof Let K be the splitting field for f(x) = x + 3. We observe K is unramified 
outside {3,00}. In fact, K has class number | and is totally ramified at 3. Let 
p = 2@x be the unique prime in K above 3. Let S be the set of all places of 
K above 3, q, or ~. 
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Suppose (q, x, k, n) is a solution to (22). Let 6 be a root of f(x), and let ¢ denote 
a primitive cube root of unity. Define 


a= (a t+cipy, a :sajOx, 0<i <2. 
Then we must have agaja2 = q” and agaja2 = q” Ox. Fori ¥ j, 
on — 0 = CL CII BE eg pk, 


Since (3,¢) = 1, we see ordy a; = O for each i. Also, it follows that the a; are 
pairwise coprime. Thus, if q | g@x, then exactly one a; is divisible by q, and 
ordy a; = n. Now fix i’ € {0, 1, 2} so that ord aj; = n for at least one q | q. 
Choose j’ # i’ and set 


Qj! aj 
Ty = —— Q3= 


5 na | 
jr — Aj Qj — aj" 


Then (71, 72) is a solution to the S-unit equation and ordg t1 = n for some q | q. 
Choose a root of unity oo and a basis »1,..., 0; for the torsion-free part of On s 
Choose b;,; € Z such that 


: L 
Di j 
T= Lp; : 
j=0 


There exists B such that |b;,;| < B. Define 


t t 
C= > lord, pj|, Cq t= lord, Pj|> 
j=l j=l 


and set cg := max{cq : q | g}. By design, 
|2k + 3| < c3B, |n| < cq. 


With these bounds established, the solutions to (22) may now be determined by 


exhaustion. Oo 
As a final remark, we observe that we may choose p so that cz = cg = 1. 
Let q1,...,q¢ be the prime ideals in K above q. As @x is a PID, we may choose 


Ai € Cx such that q; = A; Ox. Let &, & generate the torsion-free part of On. The 
choice p = [/0, &1, 2, 7, A1,.--, Ag] Now gives cz = cg = 1. 
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Computing Classical Modular Forms ®) 
for Arbitrary Congruence Subgroups sei 


Eran Assaf 


Abstract In this paper, we prove the existence of an efficient algorithm for the 
computation of systems of Hecke eigenvalues of modular forms of weight k and 
level T, where PF C SLo(Z) is an arbitrary congruence subgroup. We also discuss 
some practical aspects and provide the necessary theoretical background. 


1 Introduction 


1.1 Motivation 


The absolute Galois group of the rational numbers Gg = Gal(Q /Q) is an object of 
central importance in number theory. It is studied through its action on geometric 
objects, among which elliptic curves and their torsion points have been playing a 
dominant role. Let EF be an elliptic curve defined over Q, and let p be a prime. 
Then Gg acts on its p-torsion points E[p]. Therefore, we obtain a representation 
PE,p: GQ> GL2(Fp). 

In [28], Serre proved that the index [GL2(F») : Pe,p(Gq)] is bounded by a 
constant Cg depending only on LE. Serre went on to ask the following question, 
which remains open to this day, after half a century. 


Conjecture 1.1.1 (Serre’s Uniformity Problem over Q, [28]) Js there a constant 
C > 0 such that for any prime p > C and any elliptic curve E over Q without 
complex multiplication, the mod p Galois representation pE,p : GQ > PGL2(F p) 
is surjective ? 


There has been much progress in working on the conjecture. The remaining and 
most difficult case is to exclude the possibility that pg» has image contained in the 
normalizer of a non-split Cartan subgroup of GL2(F,) for large p. These elliptic 
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curves are classified by a modular curve X;*,(p). Equivalently, one has to show that 
for all large enough primes p, the only rational points of the modular curve X*, (p) 
are CM points. 

Therefore, there is great interest in finding explicit equations for X71, (p) over Q. 

More generally, Mazur’s Program B [20] suggests, given an open subgroup G C 
GL>(Z) to classify all elliptic curves FE such that the image of pg is contained in 
G. To this end, one would like to have explicit equations for the modular curve 
describing those, XG, over Q, in order to search for rational points. 

The general method to find explicit equations for the modular curves Xq is to 
compute g-expansions of a basis of cusp forms in S;([) where Tl C SL (Z) is 
the preimage of G in SL2(Z), and find polynomial relations between them. The 
complexity of this computation is dominated by the complexity of computing the 
Hecke operators {7;,} on the space of modular forms S; (T°). 

Thus, a main cornerstone of many efforts related to Serre’s uniformity conjecture 
and Mazur’s Program B is the computation of Hecke operators and g-expansions of 
modular forms with arbitrary level T. 


1.2. Main Results 


Let G C GL2(Z/NZ) bea subgroup, and let k > 2 be an integer. Then G pulls back 
to a congruence subgroup [ = I'g © SL2(Z) of level N. This paper is concerned 
mainly with the computation of the space of modular forms of weight k and level 
Ig, which we denote by Mz (TG). 

Our main goal is to construct an explicit model for My(I'g) over Q, which 
includes a finite dimensional Q-vector space equipped with a subspace of cusp 
forms S; (IG), and a family of Hecke operators {7,,} acting on it. 

After constructing such a model, we proceed to evaluate the systems of eigen- 
values of the eigenforms in this space, leading to computation of the zeta functions 
of the distinct components of the associated Jacobian. We then proceed to compute 
q-expansions. However, as we shall discuss later, these computations do not bridge 
the gap in the computational aspect of the efforts for computing g-expansions. 

This is achieved most efficiently, under some mild assumptions, using Merel’s 
results [23]. Let us presently describe these assumptions. 

. ’) € GL 2(Z), and denote by Ay : GL2(Z) > 
GL2(Z/NZ) the natural reduction map. 


Definition 1.2.1. We say that G C GL2(Z/NZ) is of real type if 4y(n)Gan(n)7! = 
G. We say that a group TC SL2(Z) is of real type if nPn-! =T. 


We shall make use of 7 := ( 


Remark 1.2.2 The assumption that G is of real type is essential for the Hecke 
operators to commute with the star involution (see Lemma 4.1.8). The slightly 
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Table 1 Summary of complexities for computing Hecke operators 


Algorithm | Tp, p{N Ty, (D(a), N) = 1 To 

Complexity | O(Cdp log p) | O(Cdla,rg log(N D(@))) | O (d (Cla,rg log(N D(@)) + 12 In)) 
To(N) O(dplog p) | O(dp log(Np)) O(dp log(Np) + N? log? N) 

Prs(N) O(dplogp) | O(dplog(Np)) O(dp log(Np) + N*) 
weaker assumption, that Ig is of real type, is necessary for Yr := I'\H to be 
defined over Q. 


In the description of complexities below, we denote by Ig := [SL2(Z) : Tg] the 
index of Ig, and by d := dim S; (MG) the dimension of the space of cusp forms. We 
shall describe the complexity of our algorithms in terms of field operations. 


Remark 1.2.3 Since the size of our matrix entries is bounded linearly by the 
level N, these are essentially logarithmic in N. However, when computing the 
eigenforms, some of the arithmetic is done over number fields, and the complexity 
depends on the algorithms used to implement the arithmetic over those fields. 


In what follows, we assume the existence of a function CosetIndex(x, R, I’) that 
for an element x € SL2(Z), a group I and a list of coset representatives R for 
I'\SL2(Z), returns the unique index 7, such that x € T- R[i]. We will denote by C 
the complexity of CosetIndex. 


Remark 1.2.4 The function CosetIndex could be implemented in general, by 
noting that [g\SL2(Z) = Go\SL2(Z/NZ), where Go := GN SL2(Z/NZ), 
and using the Todd-Coxeter algorithm [35], with complexity of O(/ oh * field 
operations. However, for certain groups I’, there are more efficient implementations. 
For example, if ! = l'9(V), this can be done in O(1) field operations using the fact 
that [\SL2(Z) = P!(Z/NZ). Moreover, by allowing pre-computation of O(N?) 
field operations and memory of O(N°) field elements, we can use a table and access 
it using O(log N) field operations. 


For the convenience of the reader, a table is included in which the complexities 
of the different algorithms for computing Hecke operators are compared to one 
another, and specialized to o(V), Pns (NV) (Table 1). 


Theorem 1.2.5 (Corollary 4.5.11) There exists an algorithm that given a group of 
real type G C GL2(Z/NZ), a prime p { N such that pmod N € det(G) and 
an integer k = 2, computes the Hecke operator T, on the space of modular forms 
Si(0G) using O(C - dp log p) field operations. 


Example 1.2.6 


(1) If © = To(N), then C = O(1). Thus we obtain O(dp log p), the standard 
complexity. 
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(2) If f = Tys(N) is the non-split Cartan subgroup, then C = O(1). Again we 
obtain O(dp log p). Note that here d = dim S;(I°), where standard methods 
cost O(dop log p), with dp = dim Si(To(N2)) > d. 


We also provide an algorithm for computing Hecke operators corresponding to 
arbitrary double cosets. For a matrix a € GL} (Q), we let D(a) = det(a)/d, (a) € 
Z, with d; (a) the greatest common divisor of all the entries of w. Denote also Jy, = 
[Pia Ta AP. 


Theorem 1.2.7 (Corollary 4.3.3) There exists an algorithm that given a group of 
real type G C GL2(Z/NZ), an element a € GL} (Q) such that n~!an € Tgalg 
and gcd(D(a), N) = 1, and an integer k > 2, computes the Hecke operator Ty, 
corresponding to the double coset 'GaVG, on the space of modular forms Sx(1G), 
using O(Cdly rg log(N D(a))) field operations. 


Example 1.2.8 If © = [o9(N) anda = (; ;) with p { N, then Jy,r = p+1,C = 
P 


O(1) and D(a) = p. Thus we obtain O(dp log(Np)), slower than the algorithm 
described in Theorem 1.2.5. However, this algorithm works in greater generality. 


Remark 1.2.9 If G is not of real type, the above algorithm still computes the Hecke 
operator TJ, on the space S;([G) ® Sx (Ig). This, in turn, gives rise to a system of 
eigenvalues, from which one can deduce the zeta function. An example is (group 
8E! in [13]) 


©=((35):(03): (07) Gi) -Ga)ps men 


which is not conjugate to any subgroup of real type. In this case, dim $2(G) = 1, 
and one may compute the Hecke operators using the code as follows. 


> gens <=] [[ 7; 0% Oy 7 J, 2, 3) 3) S Jell-O, Te HT ds 
3, 0, 0; 3 ],1 4; 7, 7% 3 113 

N := 8; 

H = sub<GL(2,Integers(N)) | gens>; 

H := PSL2Subgroup (H_N) ; 

M := ModularSymbols(H, 2, Rationals(), 0); 

S := CuspidalSubspace (M) ; 

> HeckeOperator(S,97) ; 

[18 0] 

[ 0 18] 


VvVvV VV 


We describe a general algorithm to compute the Hecke operators, which works 
for the case gcd(D(a), N) > 1 as well. To describe its complexity, denote the 
complexity of membership testing in G by In. 


Theorem 1.2.10 (Theorem 4.2.10) There exists an algorithm that given a group of 
real type G C GL2(Z/NZ), an element a € GLZ(Q) such that n~!an € Pgalg 
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and an integer k > 2, computes the Hecke operator Ty corresponding to the double 
coset (gal, on the space of modular forms S,(1G), using 


O (a (Clare log(N D(a)) + IzIn)) 
field operations. 


Example 1.2.11 If [ = To(V) anda = i) with p | N, then Ig = 
P 


O(N log N), yielding O(dp log(Np) + N? log” N). One may notice that the second 
term we are picking up, due to the generality, is quite large. 


These algorithms allow us to compute zeta functions efficiently. For a complete 
zeta function, we need also the values of the Hecke operators at primes dividing the 
level. Therefore, we need to assume an additional hypothesis. 


Definition 1.2.12 Let G C GL2(Z/NZ), and let p | N be a prime. Let foal, 
be elements in GL} @Q such that 7, is a linear combination of the Tey,¢° We say 
that the Hecke operator 7, on Sx(I'G) is effectively computable if there exists 
an algorithm to compute a set of such elements {a,j} using O(Cd(p log p + 
kIg log(kIg)) + d?) field operations. 


Example 1.2.13 


(1) When I = I'o(N), Ty is effectively computable for all p | N, asap = (; °) 
P 
can be computed in O(1) field operations. 
(2) When the order generated by I" contains no elements of determinant p, T, = 0 


is effectively computable. 


We further denote by In the complexity, in field operations, of membership test 
inG. 


Corollary 1.2.14 (Corollary 6.3.4) There exists an algorithm that given a group 
of real type G © GL2(Z/NZ) with surjective determinant such that for all p | N, 
T, is effectively computable, and a positive integer L, returns the zeta functions 
associated to each of the factors of Jac(X (UG)) up to precision L~* using 


O(d(C(LlogL + N log N) + NIZ In+Ig logg)) +d?) 


field operations. 


This further allows one to compute g-expansions as follows. Consider the 
inclusion ¢ : Sz,(TG) € Sx((N)), where N is the level of Ig. This inclusion is not 
Hecke equivariant in general. Therefore, our eigenvalues are not the g-expansions 
of forms in S;(T@). 

However, we have an action of GL2(Z/NZ) on S;,(T(N)). Write S,(T(N)) = 
Dic , Vi, with each V; an irreducible GL2(Z/NZ)-representation. Since V := 
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uU(Si(TG)) = Sr(T(N))%, we have V = @Oier V 1 Vi. But in P(N) the q- 
expansions of newforms do correspond to the zeta functions of the appropriate 
factors. Therefore, the g-expansions we have found for a basis of V M V; are the 
q-expansions of newforms in V;, denote them by g;;. By Atkin et al. [3], for each 
i, the space V; is generated by twists of the {g;;}. Thus, we are now led to finding 
the linear combinations that land in V M V;. This can be done either by computing 
the GL(2, Z/NZ)-action on cusp forms, as described in [38], or by using modular 
symbols for (NV), and solving linear equations. This can be done in several ways, 
and is currently in the process of being implemented by Josha Box (see [9]) and by 
the author. 
Thus, obtaining the g-expansions remains highly inefficient for two reasons. 


(1) In order to obtain the g-expansions one has to work in the larger space 
Sk(P(N)). 

(2) The linear algebra required to obtain the g-expansions has to be done over large 
fields, as the twisting operators introduce the Gauss sums of the characters, 
which are elements in the large cyclotomic field Q(éy,, €¢(n))- 


The method illustrated in this paper has been implemented in the MAGMA 
computational algebra system, by building on the existing package of modular 
symbols, implemented in MAGMA [8] by William Stein, with contributions by 
Steve Donnelly and Mark Watkins. The package is publicly available in the github 
repository [2]. 


1.3 Recovering Known Results 


As an application of our algorithm we recover the equation for the canonical 
embedding of X5,(13) in Po as was done in [4]. Effectively, this repeats the work 
done in the original paper, only in an automated fashion. The main difference is that 
the computation of the eigenform in step 3 of Sect. 4 is done directly in the smaller 
space S2(I's,), instead of inside Sz (I"9(169)). 


Corollary 1.3.1 ([4, Theorem 1.8]) The modular curve X 5,(13) is a genus 3 curve 
whose canonical embedding in Po has the model 


Axty — 3x*y? 4+ 3xy? — x3z 4 16x yz — 11xy*z 
+5y>z + 3x777 + Oxyz* + yr dogg? ap 2yz? = 0. 


Another application recovers the main result of [6], namely explicit equations 
for the canonical embeddings of X + (13) and X,(13) in Po: Again, this basically 
repeats the work done in the original paper, only obtaining the result of Sect. 4 ina 
matter of seconds. 
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Corollary 1.3.2 ([6]) The modular curves X**,(13) and X(13) are defined by the 
equation 


(—y — z)x3 + (2y* + zy)x? + (-y? + zy? = 227y + 23x + (2z*y* — 3z7y) = 0. 


Similarly, we are also able to reproduce the results of [21] for x (17), X ce 5(19) 
and XT (23). 


ns 


1.4 New Results 


An interesting problem that requires the computation of such Hecke operators is 
the question of decomposition of Jacobians of modular curves. The factors in these 
decompositions should have interesting arithmetic. We could perform the following 
using our code in 31min. Note that using Chen’s isogeny, one would need to 
compute and decompose the space of modular symbols $2 (I9(977)). 


Corollary 1.4.1 The Jacobian of the modular curve X;*,(97) decomposes over Q 
as the direct sum of 13 Hecke irreducible subspaces, of dimensions 


3,4, 4, 6, 7, 7, 12, 14, 24, 24, 24, 56, 168. 


In particular, it has no elliptic curve factor. 


Remark 1.4.2. Any method of computing modular forms of weight k = 1 relies on 
computation of modular forms of higher integral weights (e.g. [10], [27]), so that 
our results apply equally well. 


1.5 Related Literature 


The theory of modular forms for GL2 has been established for quite some time. 
However, the theory usually restricts to the Iwahori level subgroups, [9(V) and 
1" (N). There are a few exceptions: Shimura, in [29], explores a mild generalization, 
and much work has been devoted to some special cases, such as those groups 
induced by maximal subgroups of GL2(Z/NZ). In this vein, computation has also 
been restricted to the subgroups I'9(NV) and 1’; (NV), as in [32] and in [11]. 

Nevertheless, many new applications arise [4, 5, 16, 34] in which description of 
the space of modular forms M;(I') is needed, when IT C SL 2(Z) is an arbitrary 
congruence subgroup. 
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1.6 Organization 


The paper is structured as follows. 

In Sect. 2, we introduce the definitions and the statement of the problem. 

In Sect. 3, we explain the construction of an explicit model for M;(T) using 
modular symbols, how to construct a boundary map 0, and through it the subspace of 
cusp forms S;(I°). This description of S,(I°) was already known to Merel, see [23], 
but we present a general algorithm for the computation of 0. So far, the literature has 
only discussed the computation of 0 for Iwahori level subgroups. (see [12, 32, 37]). 

Section 4 contains the core of this work, the computation of the Hecke operators. 
In Sect.4.1, we begin by defining the Hecke operators corresponding to double 
cosets. Then, in Sect.4.2, we present a general algorithm for computing them, 
thus proving Theorem 1.2.10. We then present a variant of this algorithm which 
is more efficient, but applicable only to Hecke operators away from the level, thus 
yielding Theorem 1.2.7. In order to define the Hecke operators T,, for integers n 
that are coprime to the level of I’, i.e. (7, N) = 1, in Sect.4.4, we need to view 
the modular curve adelically. We start from the adelic Hecke operators and show 
that they correspond to certain double coset operators that were already defined. In 
Sect. 4.5, we use the techniques of Merel from [23] to prove Theorem 1.2.5. 

Section 5 describes the degeneracy maps, and the old and new subspaces, which 
will be useful when decomposing our space of cusp forms. Section 6 contains 
miscellaneous algorithms needed to compute the zeta functions associated to the 
eigenforms, given the Hecke operators. In Sect.7 we describe several applications 
of our result to contemporary research. 
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2 Setup and Notation 


In this section, we present the basic setup and introduce some notations that will be 
used throughout the paper. 


2.1 Congruence Subgroups 


Let N be a positive integer, G C GL2(Z/NZ) a subgroup, and Gog = GN 
SL2(Z/NZ). Let Ay : M2(Z) — M2(Z/NZ) be the natural reduction map, and 
let Aw,0 : SL2(Z) > SL2(Z/NZ) be its restriction to SL2(Z). 

Then TG := AN'o(Go) © SL2(Z) contains [(N) := ker(Aw.o), hence it is 
a congruence subgroup. Moreover, every congruence subgroup arises this way. In 
general, we will denote by I’ a congruence subgroup of SL2(Z). 

We denote by H the complex upper half plane H := {z € C | Im(z) > O}. 
Then 1 admits a natural action of GLJ(R) via Mobius transformations and we let 
Yr := T'\H be the affine modular curve of level I’. 


Definition 2.1.1 Denote H* = H UP! (Q). The group GLZ(Q) also acts on P!(Q) 
via MGbius transformations, and we let Xp := I'\H* be the modular curve of level 
I’. The points of P\P!(Q) = Xp — Yr are called the cusps of Xr. 


Definition 2.1.2 We say that Ig is the congruence subgroup induced by G. 
Here are several useful examples. 


Example 2.1.3 The following are standard examples of congruence subgroups, 


which will be used throughout. 


(jyrg S ne o lac can 2)*| Then Te = Pn). 


(2) Let G = {(6 ) |a €(Z/NZ)*,be ina}. Then TeEH{Ti@: 


(3) Let G = IG )) |a,d €(Z/NZ)*,be zjwal, Then Pg = To(N). 
(4) Let h be a subgroup of (Z/NZ)”, and let t be a divisor of N. Let 


Ge IG ) laeh,d €(Z/NZ)*,be rziNz}. Then lg =, 2). 
Before presenting a non-trivial interesting example, we define the notion of a 
non-split Cartan subgroup. 


Definition 2.1.4 Let A be a finite free commutative Z/NZ-algebra of rank 2 with 
unit discriminant, such that for every prime p dividing N, the F,-algebra A/pA 
is isomorphic to F p2: Then the unit group A®* acts on A by multiplication, and a 
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choice of basis for A induces an embedding A* <> GL2(Z/NZ). The image of 
A” is called a non-split Cartan subgroup of GL2(Z/NZ). 


Example 2.1.5 Let G be a non-split Cartan subgroup of GL2(Z/NZ). Then [g = 
Pus (NV). Similarly, if 9t(G) is its normalizer in GL2(Z/NZ) then Pyyq) = rN). 


2.2. Modular Forms 
ab 


) € GL} (Q). Let f : 
c 
H — C, where H := {z € C | Im(z) > O}. We define for any z € H 


Definition 2.2.1 Let k > 2 be an integer. Let g = 


fle(z) = (det gk! - (cz + d)~* f (gz). 33) 


Denote by M;(I) the space of holomorphic modular forms of weight k with 
respect to T° (i.e. holomorphic functions f : H — C such that f|, = f for all 
y €T and f is holomorphic at the cusps of ['\H). Denote by S;(I°) the subspace 
of cusp forms (those vanishing at the cusps). 


Since (NV) C IT’, we know that (; ") eT, hence f(z +N) = f(z) for all 


f € Mi(L). It follows that f admits a Fourier expansion f(z) = 7°29 ang”, with 
an € C and gq := er. 


This is now enough to state a computational problem: 


Problem 2.2.3. Given a group G C GL2(Z/NZ), an integer k > 2 and a positive 
integer L, find the g-expansion of a basis for S,(IG) up to precision q”. 


This problem has been answered extensively in the literature for (gc = T'\(N) 
and 'g = I9(N) (see [31, 32, 17, 11]). In these cases, the solution to Problem 2.2.3 
is through the following steps: 


(1) Construct an explicit vector space representing S; (Ig). 

(2) Compute the action of Hecke operators on this space. 

(3) Decompose the action into irreducible subspaces. 

(4) Write down the g-expansion from the systems of eigenvalues in the above 
decomposition. 


Among these steps, step (3) and (4) follow from (2) by linear algebra techniques. 
Moreover, the Hecke algebra of S;(G) is generated by the double coset Hecke 
operators {7,} (see Corollary 4.1.4). Therefore, we concentrate our efforts on the 
following problem. 


Problem 2.2.4 Given a group G C GL2(Z/NZ), an integer k > 2 and an element 
ae GLZ(Q) compute the matrix of the Hecke operator 7, with respect to a basis 
of S,(T@). 
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We address this problem in full in Sect.4, and present several solutions—one 
general solution with no additional assumptions, and under some mild assumptions, 
we have a more efficient solution. 

Unfortunately, in the general case step (4) does not follow from step (3) in the 
same way. The reason is that the Hecke operators acting on S,(IG) have no longer 
a nice description in terms of the Fourier coefficients. For an example of such a 
description, see [38]. 

We can and will show that the Hecke operators we construct correspond to the 
Frobenius action on the curves, hence the system of eigenvalues of any eigenform 
f does yield the zeta function of A, the corresponding factor of the Jacobian. 


3 Explicit Computation of $;,(T) 


First, we have to construct a model for the vector space S;(I°). We will do so through 
the means of modular symbols. For the results of this section, it suffices to assume 
that T’ is a finite index subgroup of SL2(Z). Therefore, in this section, we will 
suppress the notation of the group G ing. 


3.1 Modular Symbols 


Let Mp be the free abelian group generated by the expressions {a, b}, for (a, b) € 
P!(Q) x P!(Q,, subject to the relations 


{a,b} + {b,c} + {c,a}=0 Va, b,c € P'(Q 
and modulo any torsion. Let My := Sym‘~? Z? @ Mb, where Sym*~? is the 


representation of GL2 of highest weight k — 2. 
There is a natural left action of GL2(Q) on Mx by 


g(v @ {a, b}) = gv ® {ga, gb}. 
Definition 3.1.1 Let M,(T) = (Mk)r := Mk/ (yx —-x) be the space of T- 
coinvariants, the space of modular symbols of weight k for T (over Z). The space of 
modular symbols of weight k for T over a ring R is 


Mi (0; R) := My (LP) @z R. 


The reason we are interested in the space of modular symbols M;(I°) is the 
following theorem proved by Manin: 
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Theorem 3.1.2 ({19, Theorem 1.9]) The natural homomorphism 
yg: Mo2(l) > A (Xr, cusps, Z), 


sending the symbol {a, b} to the geodesic path in Xy between Ya and Tb, is an 
isomorphism. 


This will give us the connection to modular forms. We now turn to the reason 
that modular symbols are useful for computation. 


3.2. Manin Symbols 


Definition 3.2.1 Let g € SL2(Z). The Manin symbol [v, g] € Mi (L) is defined as 


Lv, g] = g(v @ {0, oo}). 


ca(3): (2). GY) 


We have a right action of SL2(Z) on Manin symbols as follows: 


Let 


[v, glh =[h~'v, gh]. 


The following theorem gives an explicit recipe for constructing a concrete 
realization of M; (I) as a free module of finite rank. 


Theorem 3.2.2 ({23, Propositions 1,3]) The Manin symbol [v, g] depends only on 
the class Vg € Y\SL2(Z) and on v. The Manin symbols {[v, 8 becsty(Z),vesym'222 
generate M; (I). Furthermore, if x is a Manin symbol, then 
x+xo =0, 
xtxttx0r= 0, 


x-xJ=0. 


Moreover, these are all the relations between Manin symbols, i.e. if B is a basis 
for Sym'-?Z? then 


Ie 


My (I) DBs Z-[b, 81] /R 


beB,geT\SL2(Z) 
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where R are the above relations, and torsion. 
This allows one to compute a model for M; (I’) efficiently. 


Corollary 3.2.3. There exists an algorithm that given a finite index subgroup T © 
SL2(Z) and an integer k > 2, computes a basis of Mx(T) in O({SL2(Z) : T]) basic 
CosetIndex operations. 


Proof This was implemented by Stein for T = '9(V) in [32], and the general case 
is similar. oO 


Remark 3.2.4 The implementation in [32] makes use of the fact that 
To(N)\SL2(Z) = P!(Z/NZ). This allows him to perform CosetIndex operations 
in O(1) field operations. This is the only difference from the general case. 


Definition 3.2.5 The module Sym*~*Z? has a basis consisting of homogeneous 
elements, namely {x ae er It follows that M;,(I) has a basis consisting of 
elements of the form [x” y‘~?-”, g]. From now on, we assume all given bases to be 


of this form, and say that the weight of a basis element [x” y*~?-”, g] is w. 


3.3 Modular Symbols with Character 


Suppose T C I’ C SL (Z) are finite index subgroups such that T is a normal 
subgroup of I’ and let Q := I'’/T.. Then Q acts on M, (I) via 


(y'T) -[v, g] = [v, y’g]. (3.3.1) 


Let ¢ : OQ > Q(¢)”* be a character, where ¢ = ¢, is an n-th root of unity, and n is 
the order of ¢. Abusing notation, we will denote the induced character on I’ by ¢ as 
well. 

Let M; (I, €) be the quotient of Mi (1; Z[¢]) by the relations 


y’-[v, g]—e(y’)-[v, g] =0 
for all x ¢ M; (1; Z[¢]), y’ € Q, and by any Z-torsion. 
Note that Mj, (I, €) has a basis consisting solely of elements of the form [v, g] 


where g € I’’\SL2(Z). Then if Q is abelian, this action factorizes to the isotypic 
components: 


M,(r) = QB Mi (I, €). (3.3.2) 
ceO 


This decomposition allows for faster computation by implementing Mi (I°) as 
the direct sum of these smaller spaces. 
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3.4 Cuspidal Modular Symbols 


oo 


Let By be the abelian group generated by the elements {a}, a € P!(Q). Let By = 
Sym*-27Z2 ® Bo. We have a left action of GL2(Q) on Bx via 


g(v @ {a}) = gv ® {ga}. 


Let B, (1) := (Bx)r. This can be viewed naturally as a space of “boundary 
Manin symbols”, using the following boundary map. 


Definition 3.4.1 The boundary map 0 : My(T) — Bx(P) is the natural map 
extending linearly the map 


a(v @ {a, b}) = v@a—v@b. 


Again we have a theorem that allows us to find a nice description of this module 
as a free module of finite rank. 
Theorem 3.4.2 ((23, Propositions 4,5]) Let R be the equivalence relation on 
T'\Q? given by 


[T(Au, Av)] ~ signa) [Pr (u, v)] 


for any € Q*. Let w: BYP) > QL(P\Q’)/R] be the natural map given by 


H(pe lef) =Pmo-tran 


where P € Sym‘~*Z? is considered as a homogeneous polynomial in two variables 
of degree k — 2 and u, v are coprime. 


Then wt is well defined and injective. Moreover, 
wo A([P, g]) = PCL, Og, 0)] — PO, DIF g@, 1]. 


The space S;(I’) = ker d is called the space of cuspidal modular symbols. As in 
Sect. 3.3, we can define By (I, ¢), a boundary map d : My (T, e) > Bx(T, €) and its 
kernel will be denoted by Sx (I, €). 

In order to have a concrete realization of S,(, €), it remains to efficiently 
compute the boundary map. 


3.5 Efficient Computation of the Boundary Map 


In this subsection, we show how to compute the boundary map efficiently. For finite 
index subgroups T ¢ I’ € SL(Z) such that I is a normal subgroup of I’, and a 
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character ¢ : T’/T > Q(¢)*, we denote c,([) = dim B2(T, €). We will proceed 
to prove the following result. 


Theorem 3.5.1 There exists an algorithm that given groups T C YT’ © SLo(Z) 
such that T is a normal subgroup of V’, a character ¢ : T’/T — Q(¢)*, and an 
integer k > 2, computes a basis for Sx (1, €) in O([SL2(Z) : T’]-ce() +[SL2(Z) : 
1']) basic CosetIndex operations. 


The proof of Theorem 3.5.1 will occupy this entire subsection. We will obtain 
the basis by computing the kernel of the boundary map. Thus, given My (I, ¢), we 
would like to compute the boundary map po 0: My (Tl, ¢) > B(l, €). 

The idea follows Cremona and Stein [12, Prop. 2.2.3, Lemma 3.2, Prop. 8.13], 
[32, Alg. 8.12, Prop. 8.13, Alg. 8.14] in that we do not have to compute the space of 
cusps for I’ a priori. Instead, for every cusp in P!(Q) that we encounter, we check 
whether it is equivalent to the ones previously encountered. 

Following that route, we should be able to check whether a cusp is equivalent to 
0 (note that this is not the cusp 0, but rather the neutral element of the module 
By (I, €)) or to a previously encountered cusp. We will begin by describing the 
algorithms for testing cusp equivalence and cusp vanishing, and conclude by 
presenting the algorithm for computing the boundary map. 

These algorithms are implemented in [32] and in [12] for © = o(V), FCN). 

In order to treat the general case, we first prove a criterion for the equivalence. 


Proposition 3.5.2 Leta,b € P!(Q). Letl € SLo(Z) bea subgroup. Let ga, &b € 
SL2(Z) be such that gq(oo) = a and gp(oo) = b. LetT = & ), Then T\SL2(Z) 


admits a right action by T, via multiplication, and .a = [Tb if and only if Vga, Tgp 
lie in the same (T)-orbit of '\SL2(Z). 


Proof If there is some y € I such that ya = b, then ygg(co) = gp(ow), and so 
8) Y8a € Stab s1,(z)(00) = (T). 


Therefore we get 2, Va = T” for some integer n € Z. Thus, there exists n € Z 
for which g,7” € I'gq. The converse follows similarly. Oo 


Thus, in order to test for equivalence, we have to compute the T action. This can 
be done a priori for all cosets, as we proceed to show. 


Definition 3.5.3 Let [ C SL2(Z) be a finite index subgroup, and let R = 
(r1,...,1n) be coset representatives for "\SL2(Z). Let S = ((01, €1), ..-, (On, €n)) 
be pairs of integers. We say that S is a (T)-orbit table of T with respect to R if the 
following two assumptions hold. 


(1) The cosets I'r; and Ir; lie in the same (T)-orbit iff 0; = 0;. 
(2) For alli, j such that 0; = 0;, we have r; - T° ae el. 
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Lemma 3.5.4 There exists an algorithm that given a finite index subgroup TC 
SL2(Z), and a list of coset representatives for '\SL2(Z), R, computes a (T )-orbit 
table of T with respect to R in O([SL2(Z) : T]) basic CosetIndex operations. 


Proof Apply Algorithm 3.5.5. It just goes over the cycles, hence has linear running 
time. Oo 


Algorithm 3.5.5 OrbitTable(I’, R). Constructing the (T)-orbit table. 
Input : 


* a finite index subgroup [ C SL2(Z). 
¢ alist of coset representatives R = (r1,...,1rn) for !\SL2(Z). 


Output : a list S, which is a (T)-orbit table of with respect to R. 
(1) fori € {1,...,h}do 


(a) s; = (0,0) 
(b) t; := CosetIndex(r; - T, R, T) 


(2) i:=lo:=1,e:=0 
(3) while i < hdo 


(a) sj := (0, e) 
(b) i:= fj 
(c) if s; 4 (0, 0) then 


Gi) o:=o4+1; 
(ii) e := 0; 
(iii) while i < h and s; 4 (0, 0) do 


(A) i:=it+t1; 
(d) else 
(i) e:=e+1 
(4) return (sj,..., 5p) 
And then our algorithm for cusp equivalence testing is rather simple. 
Algorithm 3.5.6 CuspEquiv(I, R, S, a, b). Test for cusp equivalence. 
Input : 


¢ a finite index subgroup l C SL2(Z). 

¢ alist of coset representatives R = (r),...,17n) for !\SL2(Z). 

e an orbit table S = ((01, €1),..-, (On, en)) (the output of Algorithm 3.5.5). 
° abe P'(Q). 


Output : If there exists y ¢€ I such that ya = b, then (true, y), else 
(false, 1). 
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(1) Find ga, gp such that gg(0o) = a, gp(O) = b. 

(2) ig := CosetIndex(gq, R, 1), ip := CosetIndex(g,, R, 1) 

(3) ifo;, A o;, then return (false, 1); 

(4) return (true, gp - T%a “i « g~!) 
Corollary 3.5.7 There exists an algorithm that given a finite index subgroup TC 
SL2(Z), a list of coset representatives R for '\SL2(Z), the (T)-orbit table of T 
with respect to R, and a,b € P!(Q), returns whether .a = Tb, in which case 
it returns also an element y € Y such that ya = b, in O(1) basic CosetIndex 
operations. 


Proof Apply Algorithm 3.5.6. By Proposition 3.5.2, it returns the correct answer. 
oO 


Finally, to test whether a cusp vanishes, we use the following proposition (an 
analogue of [32, Prop. 8.16]). Recall that if f C I’ is a normal subgroup, and 
QO =I"’/T, then Q acts on My (TL) as in (3.3.1). 


Proposition 3.5.8 Let CT’ C SL2(Z) be finite index subgroups such that T is 
anormal subgroup of T'. Let Q :=1'/T, and let e : Q > Q(¢)* be a character. 
Suppose a € P!(Q) is a cusp. For x € P'(Q,, write [x] for the equivalence class of 
x inT\P!(Q). Then a vanishes modulo the relations 


[yx] =e) - Lx] Vy’ el’, xe PQ) 
if and only if there exists q € Q with e(q) 4 1 such that [q - a] = [a]. 
Proof First suppose that such a q exists. Then 
[a] = [q -a] = ¢(q) - [a]. 


But e(g) ¥ 1, hence [a] = 0. 
Conversely, suppose that [a] = 0. Because all relations are two-term relations 
and the [’-relations identify ’-orbits, there must exist a, 8 € T’’ such that 


[w-a] = [B-a] 


and e(a) 4 e(B). Indeed, if this did not occur, we could mod out by the ¢ relations 
by writing each [a - a] in terms of [a] and there would be no further relations to kill 
[a]. Next observe that 


[p-'a -a] = e(B~!)- [a -a] = £(8)'[B -a] = [a] 


so that if g € Q is the image of B-!a € I’, then e(g) = e(B)!e(a) £1. oO 


This gives rise to an algorithm for checking whether a cusp vanishes. 
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Algorithm 3.5.9 CuspVanishing(T, ¢, 5, R, S, a). Test for cusp vanishing. 
Input : 


* a finite index subgroup [ C SL2(Z). 

* acharactere: Q =I’/T > Q(é)*. 

* asections : Q > I’ of the quotient map. 

* a list of coset representatives R for P\SL2(Z). 
¢ an orbit table S (the output of Algorithm 3.5.5). 
* aeP'(Q). 


Output : If [a] = 0 in B; (I, €) return true, return false. 


(1) For each g € Q such that e(q) 4 1 do: 


(a) equiv, y := CuspEquiv(T, R, S,s(q)-a,a). 
(b) If equiv then return true. 


(2) return false. 


Corollary 3.5.10 There exists an algorithm that given a finite index subgroup T C 
I’ C SL2(Z) such that T is normal in Y’, a character ¢ : T'/T > Q(¢)%*, a list 
of coset representatives R for [\SL2(Z), a (T)-orbit table of TY with respect to R, 
and a cusp a € P!(Q), returns whether [a] = 0 in B,(T, €) in O({T’ : T]) basic 
CosetIndex operations. 


Proof Apply Algorithm 3.5.9. The correctness follows from Proposition 3.5.8. 0 


Relying on Algorithms 3.5.6 and 3.5.9, we may now formulate an algorithm 
to compute the boundary map M;(T,¢) — Bx,(T,¢). In the algorithm below, 
CuspIndex is a function that given a finite index subgroup [ C SL2(Z), a list of 
coset representatives R for [\SL2(Z), a (T)-orbit table of [ with respect to R, a 
cusp a and a list of cusps C, returns an index 7 such that a is equivalent to C; in 
By, (I) and an element y € I’ such thata = y -C;. Ifa ¢ C, it appends a to C. This 
function is obtained by using Algorithm 3.5.6 repeatedly. 


Algorithm 3.5.11 BoundaryMap(T, IT’, ¢,k, B, R, R’). Computation of the 
boundary map. 


Input : 


* finite index subgroups T C I’ C SL2(Z) such that T is normal in I’. 
¢ acharactere : T’/T > Q(¢)*. 

* anintegerk > 2. 

¢ B= (b,,..., bn), an ordered basis of Manin symbols for Mi (I, e). 


* R=(r,...,/rh), coset representatives for [\SL2(Z). 
° R’=(r},...,1),), coset representatives for I’\SL2(Z). 
Output : 


* amatrix A. 
¢ alist of boundary Manin symbols C. 
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such that A represents the boundary map pz o 0 with respect to the bases B, C. 


(1) C:=@ 

(2) A:=0€e Q(ey”*N 

(3) S := OrbitTable(T, R) 
(4) S’ := OrbitTable(I’, R’) 
(5) fori € {1,2,...,n}do 


(a) Write b; = [P, g] 
(b) w := weight(b;) 
(c) ifw=k-—2: 


Gi) a:=g-@ 
(ii) if not CuspVanishing(T, ¢, 5, R, S, a): 


(A) j, y, C := CuspIndex(I’, R’, S’, a, C) 
(B) Ajj = Aij tel (y) 


(d) if w =0: 


(i) a:=g-0 
(ii) if not CuspVanishing(T, ¢, 5, R, S, a): 


(A) j, y, C := CuspIndex(I’, R’, S’, a, C) 
(B) Ajj = Ay — e'(Y~) 


(6) return (A are Cc 
Remark 3.5.12 Note that when k = 2, we execute both (c) and (d). 
We can now proceed to prove Theorem 3.5.1. 


Proof (Of Theorem 3.5.1) The computation of S,(I, €) is performed by using 
Corollary 3.2.3 to construct a basis B of modular symbols for Mx (I, ¢), and then 
applying Algorithm 3.5.11. 

The computation of the (T)-orbit tables for both T and I’ using Algorithm 3.5.5 
costs O([SL2(Z) : ']) basic CosetIndex operations by Lemma 3.5.4. 

The function CuspIndex is implemented by applying Algorithm 3.5.6. The check 
whether a cusp vanishes is done by applying Algorithm 3.5.9. By Corollaries 3.5.7 
and 3.5.10, these algorithms return the required output in O(1) basic CosetIndex 
operations. 

Since there are 2 - [SL2(Z) : T’’] Manin symbols of either maximal or minimal 
weight, and we have to check equivalence of at most each of these against each of the 
cusps in the list, which has at most c,(I") elements in it, it follows that the running 
time for Algorithm 3.5.11 is O([SL2(Z) : T’]-ce(1)+[S$L2(Z) : T]). Thus, we have 
an algorithm for computing S,(T, €) in O([SL2(Z) : T’]- ce(T) + [SL2(Z) : T]) 
basic CosetIndex operations. oO 
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3.6 Pairing Modular Symbols and Modular Forms 


Let Sy (T) = {f : f € Sx(T)} be the space of antiholomorphic cusp forms. We then 
have a pairing 


(,-) 2 (Se(P) ® Se(P)) x My (P) > C 


given by 


B B 
(fi, fe), P @ ta, B}) = i A@PE Dae% i f@PEVdz 3.6.1) 


The following theorem is the final ingredient needed to get a handle on the actual 
spaces of modular forms. 


Theorem 3.6.2 ({30, Theorem 0.2], and [23, Theorem 3]) The pairing 
(-.-) 2 (Sk) ® SP) x S(P, C) > C 


is a nondegenerate pairing of complex vector spaces. 


This is still not completely satisfactory, as we would like to separate the 
holomorphic forms from their antiholomorphic counterparts. 


3.6.1 The Action of Complex Conjugation 


—10 


Definition 3.6.3 Let 7 = ( ai 


) € GL2(Z). If n normalizes T, i.e. nn = T, 
we say that I is of real type. 
We then have the following result. 


Proposition 3.6.4 (Merel, [23, Proposition 7]) Let ! C SL2(Z) be a subgroup 
of real type. Then there is a complex linear involution t : Sx(T) ® S(T) > 
Si(P) ® Sx(V) which exchanges S,(V) and S(T), given by u(f)(z) = f(—Z). 
The involution W : My(T) > My(P) defined by 


WY (v @ {a, B}) = —nv ® {na, np} (3.6.5) 


is adjoint to t with respect to the pairing (3.6.1). Moreover, t’ acts on Manin symbols 
via 


WY (vu, gl) = —Inv, ngn7']. 
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We may now state the final result about the pairing, which explains how modular 
symbols and modular forms are related. 


Theorem 3.6.6 (Merel, [23, Proposition 8]) Let S,(I°)t be the +1 eigenspace for 
* on Sx(L). The pairing (3.6.1) induces a nondegenerate bilinear pairing 


Se)? x (LT) > C. 


We denote by c(I’) = dim B2(T) the number of cusps of I’. Then it follows that 


Corollary 3.6.7 There exists an algorithm that given a finite index subgroup TC 
SL2(Z) and an integer k > 2, computes a basis for S;(0) in O([SL2(Z) : T]-c(1)) 
basic CosetIndex operations. 


Proof In order to compute S;(I")T, it is necessary to replace B,(I") by its quotient 
modulo the additional relation [(—u,v)] = [(u, v)] for all cusps (u, v). Algo- 
rithm 3.5.11 can be modified, as in [32], to treat that case by changing CuspEquiv to 
check equivalence with both (u, v) and (—u, v). Thus, the complexity described in 
Theorem 3.5.1 is valid also for the computation of S;(I’, ¢)*. In particular, taking 
Tl’ =T and ¢ to be the trivial character, we obtain the result. oO 


4 Hecke Operators 


Now that we have a realization of S;(Ig), we would like to be able to compute 
Hecke Operators on this space. First, let us recall a few basic facts about Hecke 
operators in general. 


4.1 Hecke Operators on M;(T) 


Definition 4.1.1 Let ! C SL2(Z) and suppose A C GL; (Q) is a set such that 
TA = AT =A, and I\A is finite. Let R be a set of representatives for '\ A. Let 


Ta: M(T) > ML) 


P= Do ie 


aceR 


where f|q is the usual right action of Gre (Q) given by (2.2.2). The operator T, is 
called the Hecke operator associated to A. 


It is a standard result that this operator is well-defined and independent of R. 
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Proposition 4.1.2 ((29, Prop. 3.37], [14, Chapter 5.1]) The image of Ty lies in 
M,(0), and Ty does not depend on the choice of R. 


Definition 4.1.3 The algebra generated by the {7,}, when A runs over all subsets 
of GL} (Q) such that TA = ATP = A is the Hecke algebra of Mx (I). 


Fora € GL} (Q), we denote Ty = Trar. 


Corollary 4.1.4 The Hecke algebra of M;(1) is generated by the operators 
(TohveGLt(Q): 


Proof It is a consequence of the definition that A must be a union of double cosets 
Tal, where a € GL} (Q). In particular, the Hecke operators are linearly spanned 
by the {T,}. oO 


Example 4.1.5 Here are two standard choices for A. 


(1) If fF = 1 {(N), one usually considers for any n € Z, the set 


Al = {s € Mp(Z) | det(g) =n and g = & *) mod n| 
n 


One usually denotes T, = T,1. We note that A} = I (; ") I’, so that J, = 
n n 


Ty, with a = f . 
On 


(2) Let l = T(N). We may take for n € Z, the set 


An = {« € M2(Z) | det(g) =n and g = (5°) mod v| : 


Note that again we are taking A, = T (; °) Yr. 
n 


As our implementation of modular forms is by using the dual space of modular 
symbols, we need the dual notion. 


Definition 4.1.6 Let A C GL2(Q) be such that TA = AT = A and such that 
I'\A is finite. Let R be a set of representatives of \ A. Let 


TX >: Mk (T) > Mg (P) 


v ® {a, B} +> } (uv) @ {5a 5B}. 


5ER 


Again, this map does not depend on the choice of R. We then have the following 
result. 


Computing Classical Modular Forms for Arbitrary Congruence Subgroups 65 


Proposition 4.1.7 (Merel, [23, Proposition 10]) The operators Ty, and TX are 
adjoint with respect to the bilinear pairing (-, -) defined in (3.6.1). 


Therefore, by Theorem 3.6.2, in order to compute the restriction of the Hecke 
operators T, to S,(I) ® S;(L), it will suffice to compute the Hecke operators TX on 
Sx (I). Note that in order to compute the restriction to $;(I°) using Theorem 3.6.6, 
we would need 7, to commute with the map 1, defined in Proposition 3.6.4. 


Lemma 4.1.8 Let C SL2(Z) be a subgroup of real type (recall Definition 1.2.1). 
Leta € GL Q) be such that n~'an € Pal. Then in My(V) we have Ty oY = 


Vv Vv 
WoT’. 


Proof Let R be a set of representatives for P\C aI. For each 6 € Tal, we have by 
the assumptions on I and a that 


nid E€ n {Ton = (n 'Tn) n lan (n 'T) = Tn lonl =Tafl. 
Therefore, we see that 
Tel= | | Pé=n7! (i rs) n= | | n'Pén= |_| (n-'Tn) nén= | | Tn dn, 
deR 5eR 5eR 5eR 5eR 
so that R’ = {n~!6n | 5 € R} is also a set of representatives for [\Tal’. Now 
Ty ( (v ® (a, B})) = Ty’ (—nv ® {na, nB}) 


= SY \(-dnu ® {dna, dnB}) 


deER 

= Don! 8n)v ® {nq 5n)ee, nq '5n)B}) 
5ER 

= DY) n8'v ® {nd'a, 5B) = (Ty (v ® {a, B})). 
5/ER’ 


Oo 


Corollary 4.1.9 Let G C GL2(Z/NZ) be a subgroup of real type (recall 
Definition 1.2.1). Let « € M2(Z) be such that An(a) € G. Then in My(TG) we 


Vv rr Vv 
have TY ot” =i o TY. 


Proof Since G is of real type, we have 
aw(ngn |) = An(n)Gay(n)~! = G. 


It follows that 7G n! C Vg, and hence Ig is of real type. 
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Also, as Aw (a) € G, we see that 
an(n'an) € Av(n)|Gan(n) = G. 


Moreover, since det(a) = det(n~!an) and d\(a) = d\(n~'an), where d is the 
greatest common divisor of all the entries, w and n~!an have the same Smith normal 
form, showing that SL2(Z) - a - SL2(Z) = SL2(Z) - n~!an - SL2(Z). We invoke 
[29, Lemma 3.29(1)], noting that Ay (ga) = G = Ay (al G), and so, in Shimura’s 
notation, a € ®. We deduce that 


Tgalg = {g € Mp(Z) | SL2(Z)-@- SLx(Z) = SL3(Z)- g - SL2(Z) and Aw(g) € G}. 


It then follows that n~!an € 'galG. The result follows from Lemma 4.1.8. oO 


4.2 Naive Computation of T,’ 


We will now present a naive algorithm to compute the Hecke operator T,’ on My (T°) 
for an arbitrary a € GLI Q). 


Algorithm 4.2.1 HeckeOperator(a, x, B). Hecke Operator T,Y on M,(T). 
Input : 


* aeGL3(Q. 
* anelement x = v @ {a, B} € M,(T). 
¢ abasis B for the space of modular symbols My (T). 


Output : a vector representing 7,” (x) w.r.t. the basis B. 


(1) Let WH :=T na !Ta. 

(2) Let S be a set of representatives for H\T. 

(3) LetR:=a-S={a-x:x eS}. 

(4) Return ae riz(r - x)]B, where [z(r - x)]g is the vector representing the 
modular symbol ['- (rv) @ {ra, rB} w.rt. B. 


Remark 4.2.2 Even though this is unclear at first sight, the functions (T, P'2) te 
Ty NT and (P, x) x—'Tx also require a nontrivial implementation. 

Note also that l C SL2(Z) is defined using G C GL2(Z/NZ). Therefore, 
ifa@ € GIy (Q) is such that a~'Ta Z SL2(Z), a straightforward computation 
could be expensive (either find N such that N - a-'Ta C §L2(Z) and change 
computations to fit, or work in GL} (Q) with generators—that requires finding the 
generators of I"). Moreover, as GL} (Q) is not finitely generated, we do not have 
ready made tools for computing the intersection TN a~!Ta. 
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This has brought us to consider the following alternate path: 
Strategy for Computation of Algorithm 4.2.1 


(a) Implement the function (1, x) bh x—'Tx for x € GL} (Q) when x~!Tx C 
SL2(Z). 

(b) Implement an intersection function (1, [2) BH Ty AP? for, T2 C SL2(Z). 

(c) Write the set of representatives [\TaT = a- (((@—'Te) ‘a r) \r) as a 
sequence of operations of type (a), (b). 


Here is an implementation of the intersection in step 1, using (a) and (b). 
Below, we denote by '°(N) the congruence subgroup Ig, where 


Gea i) la,de GINO)" ce ZINE}. 


Algorithm 4.2.3 ConjInter(I’, w). Conjugation and Intersection. 
Input : 


¢ I C SL 5(Z) a congruence subgroup. 
* a € GLZ (Q) such that a'Ta C SL2(Z). 


Output : 7 Na7!Ta. 


(1) Find matrices x, y € SL2(Z) such that x - a7! - y is of the form & 7 


with a~'b € Z (Smith Normal Form) 
(2) Let H:=TN(a7!- (TN (y-T(a7!b)- y~!)) -@). 


First, let us prove that Algorithm 4.2.3 indeed returns the correct answer. 
Proposition 4.2.4 Algorithm 4.2.3 returns the group T Na7'Ta. 


Proof First, note that if a_'b € Z, then 


-l 
a0 aQ — p0,,-1 
() sia) (40) nSta@) = Pa b). 


Therefore 
y tax !SL2(Z)xa~!y MN SL2(Z) = T°(a7!d). 
But x € SL2(Z), so that x~!SL2(Z)x = SL2(Z). It follows that 
aSLo(Z)a Ny -SLo(Z)y 7! = yP°(a by}. 
But y € SL2(Z), so ySL2(Z)y~! = SL2(Z). It follows that 


aSL2(Z)a~! A SLo(Z) = yV°(a7!b)y 7. 
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Therefore 
aSL7(Z)a-! AP = aSL2(Z)a7!N SLo(Z) NT =P Nya! b)yA!, 
so that 
SLo(Z) Na Ta = a7} (rn yP°(a!byy!) a. (4.2.5) 
Finally, intersecting with I’, one obtains 
eTenr=—rn (a7! (r n yP°(a~'byy~") 9) =A 


| 


In order to be able to use the strategy 4.2, we should also show that the 
conjugations we are computing are in SL2(Z). 


Lemma 4.2.6 We have yl°(a~'b)y~! © SL2(Z) and a“! (FN yF°(a~!b)y“!) 
a © SL(Z). 


Proof The first is trivial, as y € SL2(Z). The second follows from (4.2.5). oO 


Remark 4.2.7 Algorithm 4.2.1 can be used to compute [7,’]g by applying it to 
each of the basis vectors. 


It remains to show how to compute the conjugation in (a), and the intersection in 
(b). For the conjugation in (a), we compute the generators of using Farey symbols 
(see [18, Theorem 6.1]). We then conjugate them to obtain the generators of the 
conjugated subgroup, and can compute the level and the reduction of this conjugated 
subgroup. For the intersection, we present the following algorithm. 


Algorithm 4.2.8 InterInduced(G, H). Intersection of induced congruence sub- 
groups. 


Input : 


* GC GL7(Z/NGZ). 
© H CGLz(Z/NyZ). 


Output : a group K C GL2(Z/NZ) such that 0k = Tg NT y. Here N = 
Icm(Ng, Ny). 


(1) d := gcd(Nog, Ny), Ga := {g € G| g = 1 mod d} and Hg := {h € H | 
h = 1 mod d}. 

(2) Kg :=Aa(G) NAq(A), and sg : Kg > G,sy: Ka — H sections of Ag. 

(3) Let Ag € GL2(Z/NZ) be aset of matrices a lifting Generators(Gqg) such 
that a = 1 mod Ny. (Chinese Remainder Theorem) 

(4) Let Ay © GL2(Z/NZ) bea set of matrices a lifting Generators(Hg) such 
that a = 1 mod Ng. (Chinese Remainder Theorem) 
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(5) Let Ag = {az : z € Generators(Kqa)} © GL2(Z/NZ) be such that a, = 
sG(z) mod NG and az = sy(z) mod Ny. (Chinese Remainder Theorem) 
(6) Return K := (Ag UAqg U Ag). 
Proposition 4.2.9 Algorithm 4.2.8 returns a group K such thatlx =TqGOTuH. 


Proof First note that for any p { N, and any e, Ape(TG Ay) = GL2(Z/p*Z). 
Then note that 


Aw(Pg AH) = {x € GL2(Z/NZ) | Ang (x) €G, Any (x) € H}. 


We will show that K = Ay(Tg NT y), hence the result. Let x €Ay(TgG ONT yg). 
Then Ag(x) € Kg. Then Ag(x) = Tike: for some {z;} C Generators(Kq). Write 
dj := az,, and consider a = |] a It satisfies Ag(a) = Ag(x), hence Ag(a~!x) = 1. 

In particular, we see that Ay, (a~!x) € Gg anda Nu (a~!x) € Hg. Therefore 
there are {xj} © Generators(Gq) and {yx} © Generators(Hq) such that 
Awe (a'x) = tee Any (a'x) = T] ye*. Let {bj} be the lift of the {x;} in Ag 
and {cx} the lifts of the {y,} in Ay. Then 


ano (TOY Tet) = 20's 


and 


tae (TTF TT) = annem 


hence 


x=] [aj Tye T] et EK. 


oO 


In order to measure the complexity of Algorithm 4.2.1, in addition to the 
CosetIndex operation, we use In to denote group membership test in G. We will 
also need some notation, which we now introduce. 

Fora € GL}(Q, let d} (a) € Qo be maximal such that a € d,(a@) - M2(Z). 

Let D(a) := det(a)/d (a) € Z, and let Jy,p := [T: @7'Ta nr). 

This leads us to the main result of this subsection. 


Theorem 4.2.10 There exists an algorithm that given a congruence subgroup of 
real type TY C SL2(Z) of level N, an element a € GL (Q) such that n~!an € Tal 
and an integer k > 2, computes the Hecke operator Ty corresponding to the double 
coset Tal, on the space of cusp forms S;(U), in complexity 


O(C - Iy,r log(N - D(a)) + [SL2(Z) : PP - In). 
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Proof We apply Algorithm 4.2.1, using Algorithm 4.2.3 to perform step (1). The 
conjugation function is computed by finding a set of generators for I’, using Farey 
symbols (see [18, Theorem 6.1]), and conjugating them. Intersection is computed 
using Algorithm 4.2.8. This computes the dual Hecke operator 7,’ on Mi(I). 
Since I is of real type, and n~!an € TaY, by Corollary 4.1.9 it follows that Ty’ 
commutes with .Y. This shows that it induces an operator on S,(I°)*. Therefore, by 
Theorem 3.6.6 and Proposition 4.1.7, we obtain Ty. 

The cost of step (1) is dominated by computing the Farey symbols, which yields 
the cost O([SL2(Z) : TJ’). 

In step (4), for each element in the basis [v, g] € B, we compute g{0, oo}, to get 
the modular symbol v @ {a, b}. Then, for every r € R apply r to get rv @ {ra, rb}, 
and then use continued fractions to convert back to Manin symbols. This is done 
Typ times. 

Since g € [\SL2(Z), it can be represented by an element in SL2(Z) whose 
entries are bounded by N*. Further, we can choose the representatives r € R up to 
a multiplication by a scalar, hence we can bound its entries by N* - D(a). 

It follows that the entries of r - g are bounded by N* - deta, and so computing 
continued fraction expansion (Euclid’s algorithm) has O(log(N - D(q@))) steps, for 
each of which one has to perform a CosetIndex operation to find the corresponding 
element in the vector space. 

Thus, the algorithm has complexity 


O(C - Iw log(N - D(a)) + [SL2(Z) : PP - In). 


4.3 Faster Implementation of the Hecke Operator T,Y 


Algorithm 4.2.1 is still quite slow in general, since the conjugation function might 
be very slow, as it depends quadratically on the index. However, under some 
simplifying assumptions we may use the following algorithm for the conjugation 
of an induced congruence subgroup. 

Here, we assume that (D(a), N) = 1. 


Algorithm 4.3.1 ConjInd(G, a). Conjugation of an induced congruence subgroup. 
Input : 


* GCGLz(Z/NZ). 
* ae GL} (Q) such that (D(a), N) = 1. Write n := D(a). 


Output : a group H C GL2(Z/nNZ) such that Py = a~'TGan SL2(Z). 


(1) Find x, y € SL2(Z) such that xay = d\(a) 0 ; 
O n-d(a) 
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(2) Let Hy := Ay (di (a)~!a)~!Gay (di (a)~!a) C GL2(Z/NZ). 

(3) Let Hy = An(y)An(M)) any)! © GL2(Z/nZ). 

(4) Let Ay © GL2(Z/nNZ) be a set of matrices a; lifting Generators(Hy) 
such that a; = 1 mod n. (Chinese Remainder Theorem) 

(5) Let B, © GL2(Z/nNZ) be a set of matrices b; lifting Generators(H,) 


such that b; = 1 mod N. (Chinese Remainder Theorem) 
(6) Return H := (Ay UB,). 


Proposition 4.3.2 Algorithm 4.3.1 returns a group H such that Vy = a~'Tgan 
SL2(Z). 


Proof First note that replacing a by d\(a)~'a, we may assume a € M>(Z) and 


d\(a) = 1. Let P’ := a7 !Pgan SL2(Z). If p { nN, then Ape (I’) = SL2(Z/p°Z) 
(for all e). We also have 


an (0’) = An(@)7'Gan(@) 


and if xay = ({ ) then 
n 


-1 
Ae) Ino yhe € ») Ane Texan (j °) dnl. 
On On 


But, since (n,N) = 1 and x-'Tgx D T(N), we have Ay(x~!TGx) = 
SL2(Z/nZ), and so we see that P= {g € SL2(Z) | An(g) € An, An(g) € Hy}, 
with H, and Hy as in Algorithm 4.3.1. Since (n, N) = 1, we claim that [’ = Ty, 
or equivalently that A, (T’) = A. 

Indeed, if g € A,w(T’), thendA,(g) € H, andAw(g) € Hy. Therefore, there exist 
{xi} © Generators(Hn) and {y;} © Generators(Hy) such that An(g) = er 
and Aw(g) = [| re Let {a;} be the matrices in A, lifting the {x;} and {b;} the 
matrices in By lifting the {y;}. Then by the Chinese Remainder Theorem, we have 


Next, we would like to understand the running time of Algorithm 4.2.1, with this 
improvement. 


Corollary 4.3.3 Let a € GL}(Q) be such that (D(a),N) = 1. Then Algo- 
rithm 4.2.1 has complexity of 
O(Ia,r - log(N* - D(@))) 


basic CosetIndex operations. 


Proof Using Algorithms 4.3.1 and 4.2.8 in step (1), we see that the main contri- 
bution to the complexity of Algorithm 4.2.1 comes from step (4), and this was 
computed in the proof of Theorem 4.2.10. Oo 
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4.4 The Hecke Operators T,, 


The question of computing the Hecke operators T,,, when (n, N) > 1, is a difficult 
one. We have not been able to find in the literature a good reference even for the 
definition of these operators. Therefore, we restrict ourselves, for the time being, to 
n such that (n, N) = 1. 


4.4.1 Adelic Definition 


The only definition given in the literature of the Hecke operator for completely 
general level structure is in adelic terms. 
Therefore, before we state the definition, we need to set up some notation. 


Definition 4.4.1 Let G C GL2(Z/NZ). We introduce 
—— {r -g-s|r€Q*,g €GLo(2),s € GL} (R), g mod N € cI © GL2(A) 


where we use the identification Z /N 227, /NZ. 


Now we may define the Hecke correspondences on the corresponding Shimura 
variety. 


Definition 4.4.2 ((29, Section 7.3]) Let G C GL(Z/NZ). Let XG := 


GLI Q\H x GL>(Q))/Sg be the Shimura variety of level Sg. Let w € GL>(Q). 
Let W = Sg NwSgw!. Then w induces a natural correspondence XG > Xgq via 


Ingle So iz, ga). 


aeSg/W 


We denote this correspondence by X (w). 
Lemma 4.4.3 The correspondence X (w), for w € GL2(O), is well defined. 


Proof First, the choice of representatives for S/W does not matter: if we replace 
each a by awa, we would get 


[z, gaWaw] = [z,gawW- WwW Wew] = [z, gaw] 
» PS 2» 


aeSg/W aeSg/W aeSg/W 


because w-!Ww C Sg. Ify € GL; (Q), then X(w)[z, g] = X(w) yz, yg]. Also, 
if we consider an element [z, gs], with s € Sg, it will map to 


Y= kz. gsewl= >> [z, gow] 


weSg/W aeSg/W 
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since sq still runs through a set of representatives of Sg/W. Oo 


We now connect it to our previous definitions of moduli spaces via the following 
Lemma. 


Lemma 4.4.4 ((24, Lemma 5.13, Theorem 5.17]) Let G C GL2(Z/NZ). Let C 
be a set of representatives for 


GL} (Q)\GL2(Q)/Sg = Z/ det(Sg) = (Z/NZ)* / det(G). 
Then 


GLE@\ (Hx GO) /Sc = L] Pe\H 


geC 


where T', = gSgg in GLI (Q). 


Under this identification, if we denote by w the canonical line bundle on XG, 
then we may identify M;,(G) with the space of global sections H (Xe, w®), 
The correspondence X (w) induces (by pullback) a map on global sections of line 
bundles T(w) : H°(Xg, w®*) > H°(XG, w®) given by fb Daesswlaw)* f. 
We then get the following corollary. 


Corollary 4.4.5. The correspondences X (w) may be identified via this isomorphism 
(explicitly given as z +» [z,1]) as correspondences Tg\H — Tgw\H. In 
particular, if det(w) € det(Sg), then we may identify the correspondence X (w) 
as correspondences on 'G\H, and the operator T (w) as an operator on M;((G). 


The reason for using Definition 4.4.2 is the following important theorem. 


Theorem 4.4.6 (Shimura, [29, Theorem 7.9]) Let w € GL2(Q) be such that 
wi = | foralll 4 p, and Wy € M2(Zp) is an element such that det(wp) = p. Then 
if p { N, the correspondence X(w) on Xgq is rational over the field of definition of 
XG and satisfies X(w) = Fry +! Fre o [det(w)—!], where X X is the reduction 
modulo p and Fr, is the Frobenius correspondence on XG x x 


This, in turn, relates these Hecke operators to the zeta function of XG. Before 
stating the connection, we introduce some notations. 

Let Zp(s; XG/Q) denote the local factor at p of the zeta function of XG. Let 
we GL(Q) be such that w, = | for all? # p and wy € M2(Z,) be such that 
det(w,) = p. Leto € SL2(Z) be such that Aw(o) = p-Ay (w)~2. Via the diagonal 
embedding SL2(Z) — GL2(Q), the correspondence X (co) is well defined and as 
det(o) = | € det(Sg), it induces an operator T(o) on My (TG). 


Corollary 4.4.7 Let w,o be as above. Let T(w), T(o) be the corresponding 
operators on Mz(UG). Then 
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(1 =p) = p!*)Zp(s; XG/Q) = det (1 — T(w)- p~* + T@)- p'-**) 
(4.4.8) 


Proof This is proved as in [29], Corollary 7.10 and Theorem 7.11. Oo 


Although this, in general, does not yield an application to g-expansions, there 
are some special cases in which it does. We describe these cases, as we shall want 
to make use of them in some of our applications. 

Recall that the Hecke operators commute, hence have common eigenvalues. 
These are called eigenforms. 


Definition 4.4.9 An element f € S;x(I'G) is called an eigenform if it is an 
eigenvector for Ty for alla € GL. An eigenform f = °°, ang” is 
normalized if a, = 1. 


Corollary 4.4.10 Let [Gg = T'(b, 0), and let f = Yang" € Sy(TG) be a 


normalized eigenform. Let w € GL2(Q) be such that w; = 1 for alll 4 p and 
wp € M2(Zp) be such that det(w») = p. Then T(w) f = apf. 


Proof This is proved in [29, Section 3.5]. oO 


Yet as another consequence, we get the following fact, which could also be shown 
directly by double coset computation. 


Corollary 4.4.11 Jf p { N, andw é€ GL>(O) is such that w; = 1 for alll ¥ p, 
and Wp € Mz(Zp) is an element such that det(w,) = p, then the operator T (w) is 
independent of the choice of w. 


This motivates the following definition. 


Definition 4.4.12 If p { N, and w € GL2(Q) is such that w; = | for all/ 4 p, and 
wp € M2(Z,) is an element such that det(w,) = p, we define T, := T(w). 


Next, we would like to write this operator in terms of a double coset operator, 
as in Definition 4.1.1. In order to do that, we show first that the subgroup W in 
Definition 4.4.2 is also induced from a subgroup of GL2(Z/NZ). 


Lemma 4.4.13 Let G C GL2(Z/NZ). Let p t N. Let w € GL2(Q) be such that 
w, = | for alll 4 p, and wp € M2(Zp) is an element such that det(wp) = p. Let 
W = SgN wSgw7!. Then one has 

p+ Mo(Zp) S M2(Zp) 0 wpMr(Zp)w;,'. 


Let Rp be the image of Mz(Zp)wpM2(Zp)w;! in M2(Z/pZ), and let Gp := Ry. 
Set 


Gy := G X Gp © GLo(Z/(Np)Z). 


Then W = Sg,,- 
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Proof First, let us show that 
p- Mz(Zp) © Mz(Zp) N wpM2(Zp)w;'. 


Indeed, it is enough to prove that p-M2(Zpy) © wypM2(Z p)Wp | which is equivalent 
to 


wy |p» Mz(Zp) + wp S Mp(Zp). 


However, as det(w,) = p, we see that we! - p= Wp € M2(Z,), thus the claim is 
trivial. 

Now, let x € W. Then, as x € Sg, we may writex =r-g-s withr € Q*, 
g € GL2(Z), s € GL} (R), and g mod N € G. 

Write g = g?g,, with g? € Ilizp GL2(Z)) and gp € GL2(Zp). Recall also that 
w, = 1 for alll 4 p. Since x € wSgw7!, we have 


‘wwe SG. 


r-g?- wel gpwp-S=r-w 'gw:s=w 
Thus, there exists r’ € Q* such that 
r'.gP. Wy | SpWp € GL)(Z). 


In particular, for all? 4 p, we have r’- g7 € GL2(Z;), and as g; € GL2(Z)), we see 
that r’ € Z;*. Also, as 


r’. we | &pWp € GL2(Zp) 


and gp € GL2(Zp), we see that r’ € {£1}, and w,! gpwp € GL2(Zp). 
Therefore, we obtain 


Sp © WpGLr(Zp)w| © wpM2(Zp)w;,'. 


As we already know gp, € M2(Zp), we see that g, mod p € Rp. 

Since gp, € GL2(Z,), it follows that g mod p = gy mod p € Gp, and from 
g mod N é G, it follows that g mod Np € G x Gp = Gy. Thus we have shown 
that x € Sgy, so that W C SG,,. 

Conversely, assume x € Sc,,. Thenx =r-g-s withr € Q*%,ge GL2(Z), 
s € GL} (R), and g mod Np € Gy. 

It follows that gmod N € G and gmodp € Gp. In particular, we see 
immediately that x € Sg. Moreover, 
1 


w xw=r-gl. Wy SpWp°s. 
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Since gp mod p = g mod p € Gp C Rp, we have 
8p € Mz(Zp) N wpM2(Zp)wz,| 


hence we! &pWp © M2(Z,). Looking at the determinant, we see that it actually lies 
in GL2(Z,). In particular, 


g =g?-w,'gpwp € GL2(Z) 


and g’ mod N € G. Therefore, w~!xw € Sg, showing equality. oO 


We now show how to use strong approximation to rewrite the operator T(w) in 
the classical description of modular curves. 


Lemma 4.4.14 Let G C GL2(Z/NZ) be a subgroup. Let p € det(G) be a prime. 
Let w € GL2(Q) be such that w; = | for alll 4 p, and Wp € M2(Zp) be such that 
det(w,) = p. Then the correspondence X (w) can be realized as 


Tgzrh ~~ Tg-q(w)-a7!z 


aclg/TGE, 


for some q(w) € Gly (Q), explicitly constructed as a function of w. Therefore Ty 
is realized as f > Deere Pes Ff lqqw)-a-!- 


Proof Let 5) € G be an element such that det(5,) = p. Let wy € M2(Z) be such 
that det(w,) = p, and let a ¢ Tg. Then 


det(S5' - A (wp)) = det(5(p))~! - det(wp) = p7! p= 1 


so that 55! -AN(Wp) € SL2(Z/NZ). Since An : SL2(Z) > SL2(Z/NZ) is 
surjective, there exists B, € SL2(Z) such that 


An (Bp) = 85! «An (wp). 


Let g(w) = By - pws! € GL. Then we have 


a-g(w)!=p! -awpB,! =p !-aw- w'wpB,- 

Let s = ea Then s, = We! WpB;! = pe € SLo(Z) © GL2(Z,). Also, 
for all # p, one has s; = wp ‘ey so that det(s;) = det(w,) - det(B,)~! = p and 
Sp € M2(Z) © M)(Z)), showing that s; ¢ GL2(Z)). Moreover, 


Aw(s) = An (wp): Aw (Bp) | = 5p €G 
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showing that s € Sg. Thereforeaw =a- q(w)7! -s—!, so that (recall that p€eQ* 
acts trivially) 


[z,aw] =[z, po! -aw-s] =[z,a-q(w)'] = [q(w)- a! +z, 1]. 


By Lemma 4.4.13, W = SGy, hence 
Sg/W =G x GL2(Z/pZ)/Gyw =VG/TG,.- 


It follows that under the isomorphism [z] +> [z, 1], the correspondence X (w) above 
is interpreted as 


TGzre > TG -q(w) -a!z, 


acelG/TE, 


4.4.2 Classical Definition 
We can now use Lemma 4.4.14 to find an equivalent definition in classical terms of 
the Hecke operator Tp. 


Lemma 4.4.15 Let a € M2(Z) be such that det(a) = p and dy(a) € G. Then 
Ty = Ty. 


Proof Let w € GL(O) be such that w, = 1 for all! 4 p, wp € M2(Zp) 
and det(w,) = p. By Lemma 4.4.14, the operator Tp, = T(w) is given by 
f > Voeere/re, flacw-p-'> Where q(w) € GL} (Q). By Lemma 4.4.13, we 
know that Gy = G x Gp, with Gp = R, where R, is the image in M2(Z/pZ) of 
My(Zpy) WpM2(Zp)w;', therefore 


T'¢/TG, = (G x GL2(Z/pZ)) /Gy = GL2(Z/pZ)/Gp. 


Recall also that det(w») = p, hence wy € SL2(Zp) - ( ) - SLo(Zp). 


Write wp = 1 - e - y2, for some y1, y2 € SL2(Zp). Then 


=i 
= 0 = 0) = 
WpMz(Zp)w,," =Y- « ) -2M2(Zp)y3 | - G ) yj! 


-1 
po po | 

cas - Mo(ZLp) - 
Y1 ¢ 1 2(Zp) ay Y 
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hence 


M2(Zp) 0 wpM2(Zp)w,' = 1+ A%(p)- yy! 


where A9(p) = Ke ) € My(Zp) |be py), and passing to the image in 
c 
M)(Z/pZ), we get 


Gp = Rp =F -V°L/pL)-77" 


where ['°(Z/pZ) is the Borel subgroup of lower triangular matrices, and 7, € 
SL2(Z/pZ). 

Also, if Bp € SL2(Z) is such that An (Bp) = Anu (p)7! -An(a) > AN (wp), then 
An (Bp - pw,') = )n(a) anda'Bp- p- we € T(N) CI, so that PB, - pw! = 
Ta, thus [Bp - pw,'T = Tal, and so (as Ta(f) = Vser\ror Fils), we may 
assume that Bp - pw! =a, hence q(w) =a. 


po 


Note then that, as p - a-! = Y- iC 1 


) : yBp'. we see that 


Ap(a~ Por) = Ap(o7! » Mx(Z) -@)* = Ap(i- A%(p)- v7!) 
= Ap Tp) 4!) = 71 -PZ/pZ)- 77! = Gp. 
Therefore 
T/( Na7'Pa) = GL2(Z/pZ)/Gp =TG/Te,. 
But we have bijection 
rir na 'fe) > (Ne !Pa)\P > P\Par 
Bre Ba Bt. 


Thus, the map Bt a- B7!:Tg¢/ TG, > P\laf is a bijection, so that 


Ta f= Do fls= Do flapt= Do flacwy.p-t = TA) = TS) 
def\lal BeEVG/TGy BETG/TGy 
establishing the result. Oo 


Corollary 4.4.16 Assume p { N. Let « € M(Z) be such that det(a) = p and 
An(a) € G. Then Ty is independent of a. Moreover, if Gg = V(b,t) and f = 
yr ang” is an eigenform of the Hecke algebra then Ty f = apf. 
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This allows us to define the Hecke operators at primes not dividing the level (and 
by multiplicativity to all n such that (n, N) = 1). 


Definition 4.4.17 Let n be such that (n, N) = 1 andn é€ det(G). Leta € M2(Z) 
be such that det(a) = n and Ay (a) € G. Let T, := Ty. 


Therefore, Algorithm 4.2.1 can be used to compute the Hecke operators 7;,, when 
n is coprime to N. 


Remark 4.4.18 Note that the definition of the Hecke operators {7}, although 
independent of the choice of representatives, does depend on G, and not only on 
Ig. This is not just an artifact of the proof, as the following example shows. 


Example 4.4.19 Let 


Gi = {+ & °) €GLiiZ/iZ) | de @/72)"| 


Let G2 C GL2(Z/7Z) be the subgroup 


G2:= (+ & a) © GL2(Z/TZ) 


which is an abelian group isomorphic to Z/6Z @ Z/2Z. 

Then Tg, = lg, = I'(7) - {£1}. But while ji on $)(I'(7)) is the familiar 
10 
02 


Hecke operator Ty for a = ( ), jt on S2(I'(7)) is simply the identity, since 


Qa ¢ Gp. 
Remark 4.4.20 As in [29, Proposition 3.31], one can show that the map 


Tol b SL2(Z)aSL2(Z), 


for all a € M2(Z) such that Ay(a@) € I3t(G), defines a homomorphism on the Hecke 
algebras. 

However, as the above example shows, this homomorphism is not injective, and 
elements a in different cosets of G\St(G) give rise to different operators. 


4.5 Efficient Implementation of the Hecke Operators T,”, 
n € det(G) 


Algorithm 4.2.1 is not as efficient as we would have liked. Specifically, the 
logarithmic factor obtained from the passage to modular symbols is, in practice, 
due to the constant, a particularly high cost. We therefore use the ideas of Merel in 
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[23] to calculate, at least the operators {T,,} more efficiently. Following Section 2.1 
in [23], we introduce the definition of a Merel pair. 


Definition 4.5.1 Let A C GL2(Q) be such that PA = AT and such that \A is 
finite. Let A = {g € GL2(Q) | & := g !det(g) € A}. Let@: A- SL2(Z) > 
SL2(Z) be a map such that 


1. For all y € A- SL2(Z) and g € SL2(Z) we have: (yg) =T- o(y)-g. 
2. Forally € As SL2(Z), we have y -o(y)— eA (or equivalently @(y)-y € A). 


3. The map P\A > A- SL2(Z)/SL2(Z) which associates to 5 the element 
5SL(Z) is injective (it is necessarily surjective). 
We say that the pair (A, ¢) is a Merel pair for T. 


We can now state Merel’s condition (Cq): 


Definition 4.5.2 Let )° uyM € C[M2(Z)], and let (A, ¢) be a Merel pair for I. 
We will say that >> umM satisfies the condition (Ca) if and only if for all K ¢€ 
ASL2(Z)/SL2(Z) we have the following equality in C(P!(Q)]: 


Y= um ((Moo] — [M0]) = [oo] — [0]. 
MeK 


Finally, we recall the following extremely useful theorem. 


Theorem 4.5.3 ({23, Theorem 4]) Let P € Cy_—2[X, Y] and g € SL2(Z). Let 
(A, @) be a Merel pair for T. Let 1) uyM € C[M)2(Z)] satisfy the condition (C q). 
We have in My (T) 


TX(P,BD= = > umIP lig O(gM)1. 
M,gMeASL3(Z) 
We also recall Merel’s condition (C,,). 


Definition 4.5.4 Denote by M2(Z), the set of matrices of M2(Z) of determinant n. 
We say that an element }° yy umM € C[M2(Z),] satisfies the condition (C,,) if for 
all K € M2(Z)n/SL2(Z), we have in C[P!(Q)] 


Y= um ((Moo] — [M0]) = [oo] — [0]. 


MeK 


Corollary 4.5.5 Let (An, ¢n) be a Merel pair for T with Ay CG M2(Z)y. Let 
Yu umM satisfy the condition (Cy), then in Mx (T) 


TX ((P, gl) = > uml P liq. n(gM)1 


M 


where the sum is restricted to matrices M such that gM € Ags L2(Z). 
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Example 4.5.6 Consider the following examples. 


(1) Let 
ab 
An = {e= (45) €Ma@ [dene =n. Lew 1a—1] 


and dy : A,SL2(Z) > SL2(Z) is amap such that 2(@n(g)) = (0: 1)-Aw(g) € 
P!(Z/NZ), where x : SL3(Z) — P!(Z/NZ) is the natural surjection. Then in 
[23, Lemma 1], Merel shows that (Ay, @,) is a Merel pair for I} (V), which is 
key to modern efficient implementation of Hecke operators. 


(2) Let p > 2 bea prime, and let u € ps be a non-square. Let 


Ym 


b 
An = {e= (24) [det(e) =m p|a—d, pb uel 


and gn, : A,SL2(Z) — SL2(Z) is a map such that dn(g)-!- fu = 
date) * - ./u, where the action of A, SL2(Z) on F 2 — F, is via Mobius 
transformations. Then similarly (Ay, ¢@,) is a Merel pair for I,s(p)—the 
nonsplit Cartan subgroup of level p. 


Let us write Go := GN SL2(Z/NZ), and let w : SL2(Z) > Go\SL2(Z/NZ) 
be the natural map z(g) = Go - Aw(g), inducing an isomorphism ['\SL2(Z) = 
Go\SL2(Z/NZ). Let s : Go\SL2(Z/NZ) > SL2(Z) be a section of z. 

Let 6, € G be an element such that det(6,) = n, and let 

An = {a € M2(Z) | det(a) =n, An(a)€ G}. (4.5.7) 
Let dn : M2(Z)n — SL2(Z) be the map defined by 
$n (a) = s(Go-n'3n - An(@)). 


Proposition 4.5.8 (A,, ¢,) is a Merel pair. 
Proof First, note that a € A,, if and only if 

det(w) = det(@) =n andndy(a)~! = An(@) € G. 
Therefore 


hy {a € M>(Z) | det(w) = n,n Aya) € c} 


Next, we verify the three conditions in Definition 4.5.1: 
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1. Let y € A, SL2(Z) and g € SL2(Z). Then 


1(on(yg)) = Go-n'dn-AN(¥g) = Go- nbn « An(y) - AN(g) 
= 1(gn(V)) An (g) = Go-An(Gn(y)) - An (g) 
= Go An (On(y)g) = T(Gn(V)8) 


hence I’: dn(yg) =T- bn(y)g. 
2. Let g € A,SL2(Z). Assume that n~'3, - Aw(g) € Go. Then n~!Ay(g) € G, 
hence g € A,. Thus g € A, iff n—'8,, -An(g) € Go. Now, note that 


Go-n'8n - An(g) = 1(bn(g)) = Go - AN (Gn(B)) 


hence 


nbn > ANn(gdn(g) |) € Go 


showing that g@,(g)~! € An. 7 7 
3. Let 61,62 € A, be such that 5155! € SL2(Z). By definition 47 (61), An (62) € 
G, hence 


Aw (618) /) = An(6; | -n-n7!- &)) = 
= An (5; 152) = Aw (61)! - Aw (52) € G 


hence 5155! el. 
a] 


Corollary 4.5.9 Assume n € det(G). Let }°yumM< satisfy the condition (Cy). 
Then, in Mx (UG) 


Ty (LP, g}) = Yo uml P lig. bn(@M)I- (4.5.10) 
M 


Proof By Definition 4.4.17 and (4.5.7), we see that T,” = T,Y for any a € An, ie. 
T,’ = ie The corollary now follows from Corollary 4.5.5, once we realize that 


M2(Z)n = Ty, ; SL2(Z). 
Indeed, for M € M2(Z),, consider 
g =M-¢n(M)' = M-s(Go-n'3n-Ay(M))". 
Then 


An (g)~! = An (bn(M))-AN(M)—! € Go-n!8(n)-AN(M)-An(M)~! = Go-n7!6, 
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hence 
n—'hn(g) € 8n- Go SG 


showing that g € fh oO 


Corollary 4.5.11 Computation of the Hecke operator T, on Sx((G), for p € 
det(G), can be done in O(p log p) basic CosetIndex operations. 


Proof In [22, Proposition 8], Merel shows that one can compute a certain set S, 
of O(o;(n) - logn) matrices, which in [23, Proposition 18], he proves to satisfy 
condition (C,,) (Definition 4.5.4). Therefore, by Corollary 4.5.9, it is enough to 
compute, for a vector [P, g] € M,(I), the sum (4.5.10), implying the resulting 
complexity for computation of the operator TY on M; (IG). 

Let a € M)(Z) be such that det(a) = p and Ay (a) € G. Then by definition 
Ty = Ty. Since Ay(@) € G and G is of real type, by Corollary 4.1.9, one can 
then compute ne on Sx(UG). Finally, from Proposition 4.1.7 and Theorem 3.6.6, 
the result follows. oO 


Remark 4.5.12 In general, the computation of the Hecke operators {T,,} for 
(n, N) = 1, can be done by using their multiplicativity property, so Corollary 4.5.11 
is all that’s needed, and gives a slightly better complexity of O( pmax - logn) where 
Pmax 1s the largest prime factor of n. 

Note also that this gives an improvement by a factor of log(N) over Algo- 
rithm 4.2.1, which is very significant in practice. 


We would also like to mention that under some mild assumptions, the decom- 
position 3.3.2 is Hecke-equivariant, hence one can compute the Hecke operator on 
each of the isotypic subspaces separately. 


Lemma 4.5.13 Let G C G’ © GL2(Z/NZ) be such that G is normal in G'. Let 
€:G'/G > Q(o)* be a character, and let « € M>(Z) be such that det(a) = p is 
a prime and dy (a) € G. Then the subspace S,(UG, €) is invariant under Ty. 


Proof Let f € Sx(UG, €), and let y € Tq. Let R be a set of coset representatives 
for 'g\l gal g. Then 


Ly=>_ t=), Ci baa Eo Day So fy 


5eR 5eR 5eR deR 

Since y € Ig and G is normal in G’, for any x € Ig we have Ay(y~! 
hence y—lxy € Tg. Thus y'Tey =TG. 
As Teal = Llser V'G6, it follows that 


xy) EG, 


Vey layle =y 'Tealey =|] vy 'Tesy = |_|] Tey 'sy. 
5ER dER 
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However, since y € Ig and Ay (a) € G, it follows that A) (y~ lay) € G, and so 
by Shimura [29, Lemma 3.29], we see that 


Vey ‘'aylG = {g € Ma(Z) | det(g) = p and Ay(g) € G} =TGala. 


It follows that Ty(f) = DVser fly-lsy> so that Ty (fly = e(v) - Ta(f), hence the 
result. oO 


Remark 4.5.14 Note that it is not enough to demand that Ig will be normal in 
Iq’. For example, if 1? | N and t | 24, then, as described in [25], there is an element 
u; € N(To(N)) defined over Q(¢,) which commutes only with the Hecke operators 
Tp such that p = 1 mod ¢ (equivalently, p splits in Q(¢,)). Indeed, u; does not 
normalize the group G C GL2(Z/NZ) appearing in Example 2.1.3 (3). 


5 Degeneracy Maps 


Our next step would be to decompose our spaces of modular forms to irreducible 
modules for the Hecke algebra. This should be possible once we apply our 
algorithms from Sect.4. The only obstacle in our path to obtain the eigenvectors 
lie in the fact that we still do not have a multiplicity one result due to the existence 
of oldforms. We therefore make a small detour, and describe the generalization to 
our situation of the degeneracy maps, and the decomposition to spaces of newforms. 


5.1 Petersson Inner Product 


Before we describe the degeneracy maps themselves, in order to obtain a good 
notion of oldforms, we briefly recall the definitions and properties of the Petersson 
inner product, so that we will consequently be able to define newforms. 


Definition 5.1.1 Let [ CG SL2(Z) be a congruence subgroup. The Petersson inner 
product, 


G)r i SP) x S(P) > C, 


is given by 


1 _ 
(f, g)r = al f(t) g(t) (Im(z))* duc(r) (5.1.2) 
T Jxp 


where dut = ae is the hyperbolic measure on H and Vr = f Xr dt is the 


volume of Xp. 
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The following standard results then help us to see that it is indeed a well defined 
inner product, and to compute adjoints of Hecke operators with respect to this inner 
product. 


Proposition 5.1.3 ([14, Section 5.4 and Proposition 5.5.2]) Let Tl C SL2(Z) be 
a congruence subgroup. The integral in the Petersson inner product (5.1.2) is well 
defined and convergent. The pairing is linear in f , conjugate linear in g, Hermitian- 
symmetric and positive definite. Leta € GL; (Q), and set a! := det(a)- aq. 
Then 


(1) Ifa-'Toa © SL2(Z) then for all f € Sx (UV) and g € Si(a—'Pa) 


(Flos 8)a-Ta = (f, Sla’)r- 


(2) Forall f,g € S(T), 


(Tu f: 8) = (f; Ta’8)- 
In particular, if a~'Ta = T then the adjoint of f > f\lq is g +> gla and in 
any case TX = Ty. 


The correspondence between a and a’ forces us to consider some more operators 
on this space. 


Definition 5.1.4 Let G C GL2(Z/NZ), and let Yt(G) be its normalizer in 
GL2(Z/NZ). Then the quotient Q := PgxG)/TG acts naturally on M;, (IG) via 
(q): Mee) > Mk(Te) 


f © fila for any a € M9xG) witha = q mod Tg. 


This operator is called a diamond operator. 


For any irreducible representation p : Q > GL, ; (C), we let 
Mi(VG, p) = p ® Homa (p, Mk(VG)) > MTG) 
be the p-isotypic component of M; (1). Then My (TG) = i; M,(VG, p) where p 


runs over all irreducible representations of Q. 


Lemma 5.1.5 Let a € M2(Z) be such that \n(a) € G, and let g € ToxGq)/Ta. 
Then Ty © (g) = (q) 0 Ty. 


Proof Denote by g € I'sxc) an element such that g = g mod Ig. Since Ay (a) € 
G and Ay (q) € It(G), we see that 


Aw (Gag) = An (G) An (@An(@G) € ANG 'Gan@ =G. 
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Moreover, det(@~!aq) = det(a), so that Ay (G~!ag) € GoAy (a). By Shimura [29, 
Lemma 3.29 (a)], it follows that g~!'wq € Pgalg hence 


~— 1 


q'Toalgg =Teq-'aglg =TGalg 


so that if Pcalg = LJ; Pca; is a disjoint union, then 
Teolg =q 'Tealgg =! (LI Fos q=| Ja 'Teug =| |Peq7'ag. 
i i i 
Therefore, for any f € M;(UG) we have 
@Qinf => flae= > flagteg = >) ON pag = EOF 
i i i 


oO 


Before we continue, we need a simple Lemma, which is a variant of [29, Lemma 
3.29]. 


Lemma 5.1.6 Let, C T2 be congruence subgroups of levels N,, N2 respectively. 
Let 


@;=(a € Mp(Z) | deta > 0, (det(a), N))=1, Ay, (Tia) =An(aT)}, i € {1,2}. 


Then the following assertions hold. 


(1) Piel = {§ e MaP2 | An, (&) € An, (1 - @)} fa € OD). 

(2) T(N))al (NM) = TN) BI (N)) if and only if T2aT2 = T2612 anda = 6B 
mod N, (fora, B such that (deta, N,) = (det 6B, N,) = 1). 

(3) T2el2 = Peal, = Tal. ifa € Bo and (deta, Nj) = 1. 

(4) Tyal; = Tjyal(N;) = TN, al; ifa € ®; fori = 1, 2. 

(5) Ifa € ®y andTyal, = |]; P10;, then V2aT2 = |]; Taj. 


Proof First, (4) is simply a restatement of [29, Lemma 3.29 (4)]. To show (3), note 
that gcd(N1, deta - N2) = No. By Shimura [29, Lemma 3.28] and [29, Lemma 3.9], 
we have 


P(N) = P(N2 - deta) - P(N1) € a! P(N2)aT (N}) 


so that a~!T(No)aTl(N2) GC a ~!P(No)al(N1). Hence T(N2)aT(N2) 
T(N2)aT (Nj) C P(No)al,. Therefore [2al(N2) C Pealy C ToaP>. But 
from (4), as a € ®2, we know that there is an equality [gal(N2) = T'2aP2. This 
shows (3). Next, to see (1) note that by Shimura [29, Lemma 3.29 (1)], we have 
Tyal) = {€ € SLo(Z)aSL2(Z) | Ay, (&) € An, (Pi - @)}. Since we also know 
Tyal) C Peal2, (1) follows. (2) is a special case of (1). Finally, leta € ), 
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and Tyaly = LJ, Pia;. Then Moat. = Toael) = LU, P20; by (3). Assume 
Moa; = [2a;. Then a; = ya; for some y € V2. By (1) 


An, (ai) € An, (P1) - AN, (@) = An, (P1) - AN, (Cj). 
Thus, there exists some 6 € I; such that a; = da; mod Nj. It follows that y = 4 


mod Nj. Since '(N1) C Ty, we have y € I’). This proves (5). oO 


We may now state the adjoints of the Hecke operators with respect to the 
Petersson inner product. 


Theorem 5.1.7 In the inner product spaces Sx((G), the diamond operators (q) and 
the Hecke operator T,, for p € det(G) a prime have adjoints 


(q)*= (gq)! and T= (op) + Tp 


where op € V'sG) is an element such that .y(op) = p+ AN (a)~?, and a € M>(Z) 
is any element with det(a) = p such that \y (a) € G. Thus the Hecke operators (q) 
and T,, forn € det(G) are normal. 


Proof The first assertion is just a restatement of Proposition 5.1.3 (1). For the second 
assertion, we note that if a € M2(Z) is such that Ay(@) € G and det(a) = p, then 
a € PG). Indeed, if v € St(G) then for any g € G we have 


(An (a@)van(@)~')gAn(a@)v'An(a@)7!) € Av (a@)vGv7'Ay(a)! CG 


so that Ay (w)vay(a)~! € NG). 
Therefore, by Lemma 5.1.6 (3), we see that Toyqoalog) = Veal sa). 
Moreover, we have 


An (a’) = det(a)Ay (a)! € TUG) 


hence Pyxqa@’ Tg) = Aa =Tyro@alaeG) = Veal na). 

Let y € Isa) be such that w’ € Tgay. Now, as y normalizes [g we see 
that [ga/Tg = Tgaylg = Pealcy and hence if fgalg = L]; leai, then 
Tgea’Tg =]; Veaiy. It then follows from Proposition 5.1.3 (2) that Ty = Ty = 
Ty = (y)Ty. Finally, we note that 


Any) € An(PG)AN(@)An(a’) © G+ pan (a)? = G + AN (Gp) 


so that y € TG - op, showing that (y) = (ap). oO 


Remark 5.1.8 The o, actually acts via a character. Indeed, if A C M(G) is such 
that A/G is abelian, and A contains the center of diagonal matrices, then op € A 
and all the irreducible representations of A/G are 1-dimensional. 


This leads to the following theorem. 
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Corollary 5.1.9 The space Sx({G) has an orthogonal basis of simultaneous 
eigenvectors for the Hecke operators {T, | n € det(G)} and the diamond operators 


{(¢q) |¢ € Voue)/ Te}. 


Proof This is a corollary of the Spectral Theorem for a family of commuting normal 
operators. oO 


Definition 5.1.10 We say that a modular form 0 4 f € S;(Ig) is an eigenform 
if for all n € det(G), f is an eigenvector of 7, and for all g ¢ [m,/TG, f is an 
eigenvector of (q). 


5.2 Degeneracy Maps 


We begin by defining degeneracy maps in general on our space of modular symbols. 


Definition 5.2.1 Let l, < Ls Ty < i be two pairs of finite index subgroups in 

SLo(Z). Let ¢; : rr /T; —> Q(g)* for i = 1,2 be characters. Let t € GL; (Q) 

be such that et Cc ieee and €2 0 Inn(t)~! I= €,. Fix a choice R; of coset 
1 


representatives for Ey - To’. Let 


ay : My(M, €1) > My(Po, €2), By: Mx (V2, €2) > Mk(P1, €1) 
oY (x) = thx BY (x) = D> 222) "ty - x. 
tyER;, 


We show that these operators are well defined. 


Lemma 5.2.2 The operators a and B,” are well defined, and moreover the 
composition a’ o BY is multiplication by Ir, ‘ ae Wa 


Proof First, note that as ae ay Cc we for any yj € Le y2 € cL one has 


yity2 = tt ly ty €E ie; 


so that Ty indeed acts on i: To show that a; is well defined, we must show that 
for each x € Mz (11, €1) and 1, € a we have 


ay (ix — €1(¥1)x) = 0 € My (Lo, €2). 


We have, by assumption 


1 -l 


ay (ix) =t yx = to ytto ls = a(t nto x = er(nay (x) 


hence the result. 
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We next verify that 6; is well defined. 
Suppose that y; <¢ [', and y € I’,, then 


yity2 = t(t7'yit)yr. 


Moreover, for any x € My (12, €2), we have 


ert" yit)yo)vityex = e1(yi)!e2(y2) 7" yityox = e1(¥1) Yi (€2(72)'ty2x). 
(5.2.3) 
But as £2(y2) |! tyox € Mz (P41, 1), (5.2.3) is simply £2(y2) | ty2x. 
Thus, replacing the representative ty2 by yt does not change the result, so that 
B,’ is independent of the choice of the representatives R;. 


Next, we must show that for any y € ins and any x € Mi (fo, é2), BY (yx) = 
£2(y) BY (x). However, fory € T’ " using the fact that £, is independent of the choice 
of representatives, and that i, acts on ri\r; by right translations, we get 


BY (yx) = So ex) Tttmyx= DS ex(ry 7!) tay 7! yx 
tyER; tyyy eR; 


=ex(y) D> &2(y2)7'tyox = 20(y) By (x). 


tyr2ERr 


To compute a’ o BY, we use that #R,; = [P, : 17!) 41: 


on (By (x)) = a |S en(ya)Ttypx 


tywER; 


= Ly £2(¥2)'Yox = > es saya 
tyoER; tyER; 
Oo 


We also note that by using the pairing (3.6.1) we can get maps on spaces of cusp 
forms. 


Definition 5.2.4 Let Tl, < re In < L, be two pairs of finite index subgroups in 

SL2(Z). Let ; : rst —> Q(g)* for i = 1,2 be characters. Let t € GL3@Q 

be such that ae Cc Ios and €2 0 Inn(t)~! I= &€,. Fix a choice R; of coset 
1 


representatives for Wa -T2’. Let 
a; : Se(T2, €2) > Se, €1), Br: Sk(1, €1) > Se (V2, €2) 


fo fla fe YS ear) 'flin- 


ty2ER; 
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5.3 Degeneracy Maps and Hecke Operators 


Next, as in the classical theory for Iwahori level, we would like to show that the 
degeneracy maps commute with the Hecke operators. Alas, this is not always the 
case as the following example shows. 


Example 5.3.1 Let ; = I(7) and let P2 = Ty5(7) be the non-split Cartan at 
level 7. Then there is a natural inclusion map any : Mo(T1) ~ Mo(P2), but ay; 
does not commute with the standard Hecke operators for these spaces. In this case 
dim S2(Mns(7)) = 1 is spanned by a single element f. We have ai(f) = f € 
S2(T(7)), but f is not an eigenform of the Hecke algebra on I'(7). Indeed, there 
are 3 such eigenforms, whose q-expansions are all defined over Q(./—3), while the 
q-expansion of f is only defined over Q(¢7)*. 


In the next proposition we describe the conditions under which the degeneracy 
maps commute with the Hecke operators. 


Proposition 5.3.2 Let Nc | Ny be positive integers. Let H C GL2(Z/NyZ) and 
G C GL2(Z/NGZ) be subgroups such that hy, (H) C G. Let p be a prime number 
such that p € det(H), and let t € M>(Z) be such that Ng - det(t) | Ny and 
t'Tyt C Tg. Let ae be the Hecke operator at p on the space My(1?) for ? € 
{G, H}. Then 


Vv Vi_wv Vv 
TG°% =O; OT; x. 


Proof Denote d := det(t), so that d - t~! € Mo(Z). Since t~'T yt C TG, we have 
alsod-t-'Tyt Cd- TG so that 


Awg-d (at!) + Ang-d(CH) * ANg-a(t)=ANng-d (ar-'Tut) C Ang-d(d -TG)=d -ANng-a(e). 
(5.3.3) 

However, we also assumed Ng -d | Ny, hence 
Ang-dH) = Ang-dAny (PC H)) = ANG-d (A). (5.3.4) 

Let 

Av, = {a € M2(Z) | det(a) = p and Ay, (a) €?}, 2 € {G, H} 
and let a € oe Then combining (5.3.3) and (5.3.4) we see that 
Ang-a(dt lat) = Ang-a(dt~")ANng-a(&ny (@))ANG-a() 


€ Ang-a(dt~')Ang-d(H)Ang-a(t) C d+ Ang-a(G). 
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In particular, reducing modulo d, we see that dt—'at € d- M(Z), hence t~'at € 

M)(Z). Therefore Ay, (t~ lat) € Av, (UG) = G. It follows that t~!at € ee 
Since p € det(#), we also get p € det(Ay,, (H)) © det(G). Therefore, we have 

by Shimura [29, Lemma 3.29] that AG =Tyaly and A =Tg-t lat -Tg. 
Let AR = LI; I’ ya; be its coset decomposition. Then 


ie : that : ae = r'THol pt = f* (U rs) t 
i 


= ie ee = | Jo'Tat a eet, 
i i 
Now, applying Lemma 5.1.6 (5) tot! yt C TG and t~! at, we obtain that 


Av =aTe.t 'aeTe=| [Text tay, (5.3.5) 
I 


Finally, we use (5.3.5) to compute 
[oa co) ay’ (x) = Lo ty =r! So aix = a,” co) TG): 
i i 


oO 
We also note that by using the pairing (3.6.1) we have the same for cusp forms. 
Corollary 5.3.6 Under the assumptions of Corollary 5.3.2, a; 0 Tp,g = Tp, 9 Qt. 


Remark 5.3.7 Note that these do not cover all degeneracy maps. Indeed, there 
might be t € M2(Z) such that Ng - det(t) { Ny, but t-'Tyt C TG and such 
that aw; commute with all the Hecke operators (though we must have det(t) | Ny). 
NO 
01 
the proof shows, for a; to commute with T,, it is enough to check that d tla pte 
d- M>(Z) for all p, which is equivalent to verifying that Ag (dt—')ag (@p)Aa(t) = 0. 
As this last expression only depends on Ay, (ap), hence only on p mod Ny, we 
just have to check commutation with ¢(N) elements. Therefore, one may include 
these degeneracy maps as well. 


As an example, consider 'g = T9(V), Ty =T'(N) andt = ( ) However, as 


This motivates the following definition. 


Definition 5.3.8 Let N be a positive integer. Let H C GL2(Z/NZ) be a subgroup. 
For each G C GL2(Z/NZ) such that H C G, we denote by Ng the level of G. Let 
Tg be a set of representatives for the orbits of 7 on the following set 


fT {t € M>(Z) | det(t) | N/Ng and 17! P yt C ro} (5.3.9) 
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Let ig be the map 


ig : (Se(TG))7o > SH) 


(fitete > Yo uC fi): 


teTg 


Let Over(#) be the set of minimal overgroups of H in GL2(Z/NZ). The subspace 
of oldforms at level H is 


SPH) = Sig ((e))"*) 


GeOver(H) 


and the subspace of newforms at level H is the orthogonal complement with respect 
to the Petersson inner product, 


SEM” = (KOH) 


We have to show that the sum in this definition is well defined. 


Lemma 5.3.10 Let N be a positive integer. Let H © G C GL2(Z/NZ) be 
subgroups. Denote by NG the level of G. Let Tg be the set given in (5.3.9). Then Tg 
is finite. 


Proof We first note that by the condition det(t) | N/No, the determinant of t¢ 
assumes finitely many values, hence it is enough to show that the subset Tg q of 
such t’s with det(t) = d is finite. 

Next, we recall that if Ag is the set of matrices in M2(Z) of determinant d, then 
SL2(Z)\ Aq is finite. Since 'y is of finite index in SL2(Z), the result follows. oO 


The Hecke operators respect the decomposition of S$, (IG) into old and new. 


Proposition 5.3.11 The subspaces S;(UG)°" and S(T)" are stable under the 
Hecke operators T, forn € det(G) and the diamond operators (op), for p € det(G) 
prime. 


Proof By Corollary 5.3.6, the Hecke operators T, for p € det(G) commute with 
the degeneracy maps a. By multiplicativity, it extends to JT, such that n € det(G). 

For the diamond operators, let G; GC GL2(Z/N;Z) for i = 1,2 be such that 
Anz(Gi) © Go, and let t € M>(Z) be such that det(t) | Ny /N2 and t'Pyt CM, 
where I’; is the congruence subgroup induced by G;. Let d = det(t). Then dt~! € 
M2(Z) and so if a is such that Av, (a@) € G; and det(~) = p, we have 


And dt~ opt) = Anya (dt~')ANy-d(D-ANy-d ()7)-ANy-a(t) € P*ANy-a (dt~!)ANy.a(G1)ANp-a(t)- 


However, since t~!T yt € Mo, we also have 
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Anya (dt~")ANy-a(Gi)Any-a(t) Cd -Any.a(V2) 


hence tlopt € M>(Z) and moreover Aw, (tl opt) € p-dn,(T2) = p- G2 © 
N(Gz2). It follows that ee: € T'9xG,), and so for any f € S,(I"2) we have 


(op) 004) (f) = Flitay = Fletaptet = (ar 0 topt)) (f). 


Finally, we note that A, (t~lopt) = p-hny (t~!at)~*, and that by the proof of 
Corollary 5.3.2, t~!at € M>(Z) is an element such that AN» (t~!at) € Go. It follows 
that (ent) is the diamond operator at p for G2. This concludes the proof. oO 


6 Zeta Functions and q-Expansions 


Using the results of Sects.4 and 5, we may now compute the zeta functions 
associated to eigenforms in S; (MG). 

All the ideas and methods used in this section are the same as in the case of 
level ')(N) or T'9(N), and are known (see [33], [32]) and used to compute the 
q-expansions. 

The process for doing so is divided into three steps: 


(1) Compute a subspace of M;(I°)” that is isomorphic to S,(I°) as a Hecke module, 
denoted by S,(T)Y. 

(2) Decompose the space S;(I°)Y into irreducible Hecke modules. 

(3) For each irreducible Hecke module in the decomposition, find the system of 
Hecke eigenvalues, hence the zeta function. 


6.1 Constructing Dual Vector Spaces 


Since our interest lies in S;(I"), we would like to have a vector space which is 
isomorphic to S(T) ® S;() as a Hecke module. We could do it by using the 
pairing (3.6.1), but that would involve computation of integrals and approximation 
if done directly. 

Instead, we can use the fact that S,([) and S,(f) are isomorphic as Hecke 
modules, thus leading to the following algorithm. 


Algorithm 6.1.1 DualVectorSpace(M). Compute the dual vector space. 


Input : M C M,(T) a subspace. 

Output : MY C M;(T)Y which is isomorphic to M as a Hecke module. 
(1) V:= MAT)’, p= 2 

(2) While (dim V > dim M) do 
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(a) Ty := HeckeOperator(V, p). 

(b) Tp,m := HeckeOperator(M; (1), p)|m. 
(Cc) Cp := CharPoly(Tp,). 

(d) V := ker(cp(Tp)). 

(e) p := NextPrime(p). 

(3) Return V. 


Here, we assume the existence of a function CharPoly, that given an operator on 
a vector space, computes its characteristic polynomial. 


Remark 6.1.2 We have not defined the Hecke operators T,, for (n, N) > 1, and 
these cannot be ignored (see [7, Example 12.2.11]). It is possible to circumvent this 
problem, as is done in [7, Section 8.10]. Practically when p | N, we use instead of 
T, the double coset Hecke operators T, with det(@) = p. There are finitely many 
such operators, and the operator T;, is a linear combination of them. Thus, for the 
purpose of describing the dual vector space, and later on, decomposition of the space 
to irreducible Hecke modules, these suffice. 


6.2 Decomposition of S;(T) 


The following theorem is a generalization of [32, Theorem 9.23]. It is an application 
of Sturm’s Theorem, which will enable us to decompose the Hecke module S; (I) 
in a finite (effectively bounded) number of steps. 


Theorem 6.2.1 Suppose I is a congruence subgroup of level N, and let 
r = Sturm(k, P) := ka = “| (6.2.2) 


where m = [SL2(Z) : 1]. Then the Hecke algebra 
T=Z[...,Tn,...] © End(S;(T)) 


is generated as a Z-module by the Hecke operators T, forn <r. 
Proof Same as in [32, Theorem 9.23]. oO 


This allows us to decompose the space S,(I°)Y = Sx (1) ® S$; (I) into irreducible 
Hecke modules, as follows. 


Algorithm 6.2.3 Decomposition(M, p). Decomposition into irreducible Hecke 
modules. 
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Input : 


¢ M CMk(L) -a vector subspace stable under the Hecke action. 
* p-aprime. 


Output : D, a list of subspaces of M, such that M = @viep Va and Vy’ is an 
irreducible Hecke module. 


(1) If p | N, replace by a larger prime such that p { N. 
(2) Tp := HeckeOperator(M™”, p), D := 9. 

(3) f := CharPoly(T,). Write f = jean cae 

(4) fori € {1,2,...,J}do 


(a) V :=ker(fi(Tp)™) © MY,W:=VY CM. 
(b) if IsIrreducible(W) 


ai) D:= DU{W} 
else 


(i) if W = M then q := NextPrime(p) else g := 2. 
(ii) D := DU Decompose(W, q). 


(5) Return D. 


Here Decompose is a recursive call to the function, and IsIrreducible is creating 
a random linear combination of Hecke operators, and checks that its characteristic 
polynomial (on the plus subspace) is irreducible. One could eliminate the random- 
ness to provably verify the irreducibility, as in [32, Chapter 7]. 

Calling Algorithm 6.2.3 with M = Sx(T), p = 2 we obtain the decomposition 
we wanted. 

Note that the algorithm terminates if there are no eigenvectors with multiplicities, 
that is there are no oldforms arising as images of forms of lower levels. In that case, 
the number of steps in the algorithm can be bounded using Theorem 6.2.1. 

In order to deal with the images of oldforms, we may begin by identifying the 
new subspace, and continuing recursively. The new subspace is computed by noting 
that it is dual to S,(( Gg)" := Me'eover(G),teT yr ker(a’). 

Also, in order to get the old subspace one need not compute the Petersson inner 
product, but instead note that it is generated by the images of the BY. 


Id. Vv 
S(T)?" = (Im(6; ))GreOver(G),teT es : 


6.3 Computing the Zeta Functions 


Finally, for a subspace V C S,(MG) such that VY is an irreducible Hecke module, 
we can compute the zeta function of an eigenform (and hence of all eigenforms in 
VY). 
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Algorithm 6.3.1 ZetaFunction(V, L). Compute the zeta function associated to an 
eigenform. 


Input : 


¢ V CS; (TG) such that VY is an irreducible Hecke module. 
¢« L-a positive integer. 


Output : Z(f,s) = )°-2, ann™ such that 7, f = anf for all n (set a; = 1), 
given to precision L~*. 


(1) Find (randomly) a linear combination of Hecke operators which is irre- 
ducible on (VY)*, denote it by T. 

(2) Let v be an eigenvector of T over the field Q[x]/fr(x), where fr = 
CharPoly(T). 

(3) Write v = a cya; where e; are the basis vectors of Mx (I°) given by 
Manin symbols, and c; 4 0. 

(4) For primes p < L, set dp := (Tp (ei), v). Complete the other a,’s using 
multiplicativity. 

(5) Return )47/ ayn7* + O(L~). 


We start by showing that the algorithm returns the correct output, and measure 
its complexity. 


Lemma 6.3.2 Let G C GL2(Z/NZ) be a group of real type of index Ig with 
surjective determinant such that T, is effectively computable for all p | N. There 
exists an algorithm that given a subspace V of Sx(Ug) of dimension d, such that VY 
is irreducible as a module for the Hecke algebra, and an integer L, returns the zeta 
function associated to an eigenform up to precision L~* in O(C log N(L log L + 
N)+ NIZ -In+d?3), where C is the cost of a basic CosetIndex operation and In is 
the cost of the group membership test for G. 


Proof We apply Algorithm 6.3.1. Since G is of real type, by Corollary 4.1.9, the 
Hecke operator commutes with the star involution, so that step (1) makes sense. For 
any p < L we have 


In particular, 


d 
(Tale); v) = (ci Ty») = (« Seneie] = ahs 
j=l 


For computation of the Hecke operators away from the level, we apply Corol- 
lary 4.5.11, while for the Hecke operators at primes p | N, we apply Theo- 
rem 4.2.10. Note that as the 7, are effectively computable for all p | N, we 
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may do so with no additional cost to our complexity. Therefore, the complexity 
of computing the Hecke operators is given by 


Cc plogp+C ss p log(N*p) + > 72 «In 


p<L,p{N p<L,p|N p<L,p|N 
=C) > plogp+4ClogN S > p+NIG-In 
p<L p<L,p|N 
= O(C(Llog L + N) log N + NIZ -In). (6.3.3) 
Finally, the d? contribution comes from the linear algebra operations. Oo 


We can now pack everything together to produce the following corollary. 


Corollary 6.3.4 There exists an algorithm that given a group of real type G C 
GL2(Z/NZ) with surjective determinant such that for all p | N the Hecke operator 
is effectively computable and a positive integer L, returns the zeta functions of the 
factors of Jac(XG) using 


O(d(C log N(LlogL + N) + NIZ - In+Ig log Ig) + d?) 


field operations, where d := dim S)((G), Ig := [SL2(Z) : TG], C is the cost of a 
CosetIndex operation, and In is the cost of membership testing in G. 


Proof We first compute a basis for S2([) using Theorem 3.5.1. Then we can 
apply Algorithm 6.2.3 to decompose it into irreducible subspaces. Here, we have to 
compute the matrices of the Hecke operators up to the Sturm bound (6.2.2), which 
is linear in Ig. 

For computation of the Hecke operators away from the level, we apply Corol- 
lary 4.5.11, while for the Hecke operators at primes p | N, we apply Theo- 
rem 4.2.10, as in (6.3.3). Since we now compute their action on a basis, we have to 
multiply by d, and compute them up to Jg to obtain O(d(C Ug log Ig + N) log N+ 


Nig -In)). 
Finally, for each of the irreducible subspaces, we should apply Algorithm 6.3.1, 
which by Lemma 6.3.2 costs O(C log N(L log L + N) + NI, - In+d?). Oo 


7 Applications 


In this section, we present a few of the applications that the above result contributed 
to, and also some future applications. All time measurements were taken on a 
MacBook with 2.3GHz 8-Core Intel Core i9 processor, and 16 GB 2400 MHz 
DDR4 memory. 
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7.1 Classification of 2-Adic Images of Galois Representations 
Associated to Elliptic Curves over Q 


In [26], the authors compute the maximal tower of 2-power level modular curves 
containing a non-cuspidal non-CM rational point, and develop special techniques 
to compute their equations. Using our implementation, one can obtain the q- 
expansions of a basis for S2(I°) for each of those groups. 

For example, in [26, Example 6.1] the authors present a curious example 
associated with the group H1s55, which is of index 24 and level 16, whose image 
in GL2(Z/16Z) is generated by 


(3):(23)-(i23)-(ta9) 


Then a quick calculation of 1.78s with our implementation yields the following. 


> tt := Cputime() ; 

> gens := [[1,3,12,3],[1,1,12,7],[1,3,0,3],[1,0,2,31]]; 
SN 2s] 16; 

> HN := sub<GL(2,Integers(N)) | gens>; 

> H := PSL2Subgroup (H_N) ; 

> M := ModularSymbols(H, 2, Rationals(), 0); 

> S := CuspidalSubspace (M) ; 

> D := Decomposition(S, HeckeBound(S) ) ; 

> qEigenform(D[1],100); 

q - 4*q*5 - 3*q*9 - 4*q*13 - 2*q*17 + 11l*q*25 - 

4xq°29 + 

12*q*37 - 10*q*41 + 12%*q*45 - 7*q*49 - 4«q*53 + 
12*q*61 + 

16*qQ*65 - 6*q*73 + 9*q*81 + 8xq*85 + 10*q*89 - 18%q*97 
+ O(q*100) 

> Cputime (tt); 

1.780 


Proceeding to compute the invariants of the elliptic curve associated to this 
eigenform, we see that the modular curve Xy is isogenous to the elliptic curve 
labelled 256b1, defined by the equation y* = x* — 2x over Q, for which X 7(Q) = 
Z/2Z x Z is generated by (0, 0) and (—1, —1). 


Remark 7.1.1 For this application, if one wants to compute the equation for the 
elliptic curve up to isomorphism, one has to compute the j-map. This can be done 
by computing the g-expansion of Eisenstein modular forms. A way to do so (in 
an even more complicated case) is explained in detail in [26, Example 6.1]. Our 
methods simply replicate the efforts originally done by the authors. For convenience, 
we have also supplied in our code package the code to generate Eisenstein series, 
e.g. for the weight 2 Eisenstein series, we can do the following. 
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> M := ModularForms (H) ; 

> gExpansion(EisensteinSeries(M) [1], 100); 

q°16 + 32*q*32 + 244*q*48 + 1024«q*64 + 3126*q*80 
+ 7808*q*96 + O(q*100) 


7.2 Modular Curves of Prime-Power Level with Infinitely 
Many Rational Points 


In [34, Corollary 1.6], the authors introduce a finite set of subgroups G of GL2(Z)) 
that arise as the image of a Galois representation for infinitely many elliptic curves 
over Q with distinct j-invariants. 

In section 6, they consider the 250 cases where G has genus | and show that the 
Jacobian of XG, Jc, for each such group is isogenous to one of a certain finite set 
of elliptic curves. Then they compute point counts of the reduction modulo several 
primes to pinpoint the correct isogeny class over Q, and show that one needs only 
consider a certain set of 28 groups. 

These are included in the examples computed in [26], and again, using our 
implementation, it is possible to construct for each of these subgroups G a basis 
of g-eigenforms for the modular curve XG. 

The following example took 2.120s. 


> tt := Cputime() ; 

> gens := [[2,1,3,2],[0,3,5,8],[1,0,0,5], [1,8,0,3]]; 
> N := 16; 

> H_N := sub<GL(2,Integers(N)) | gens>; 

> H := PSL2Subgroup (H_N) ; 

> M := ModularSymbols(H, 2, Rationals(), 0); 

> S := CuspidalSubspace (M) ; 

> D := Decomposition(S, HeckeBound(S) ) ; 

> gqHEigenform(D[1],100) ; 

q - 2*q*3 + Q*9 - 6xq*11 - 6*q*17 - 2*q*19 - 5x*q*25 


+ 4«q*27 
+ 12*q*33 + 6*q*41 + 10*q*43 - 7*q*49 + 12*q*51 
+ 4*q*57 
- 6*Qg*59 + 14*q*67 - 2*q*73 + 10*q*75 - 11l«*«q*81 
- 18«q*83 
- 18%*q*89 + 10*q*97 - 6*q*99 + O(q%*100) 
> Cputime (tt) ; 
2.120 


This is the newform 256.2.2.a. Computing the elliptic invariants, we see that this 
is isogenous to the curve 256a2 defined by y? = x3 + x? — 13x — 21. 
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7.3 Efficient Computation of q-Eigenforms for Xns(N) and 
Xns(N) 


Using the Merel pair introduced in Example 4.5.6, and a similar one for [7 (p), 
its normalizer, one obtains a highly efficient implementation of the Hecke operators 
T,, for all n. This allows one to compute the g-eigenform and verify the results of 
[15] for Xns(11), [6] for X*,(13), and [21] for X17), X*,(19) and X7 (23), as 
follows. 


tt := Cputime(); 
G := GammaNSplus (13); 
M := ModularSymbols(G, 2, Rationals(), 0); 


S CuspidalSubspace (M) ; 

D := Decomposition(S, HeckeBound(S) ) ; 
gEigenform(D[1], 20); 

+ axq*2 + (-a*2 - 2*a)*qg*3 + (a*2 - 2)*q*4 4+ 
a*2 + 2xa - 2)*q*5 + (-a - 1)*Q*6 + (a*2 - 3)«*q*7 + 
-2xa*2 - 3*a + 1)*Q*8 + (a*2 + 3%a - 1)*Q*9 + (-a 
1)*q*10 + (-a*2 - 2*ea - 2)*q*11 + (a*2 + 3%a)*q*12 + 
-2xa*2 - 2ea + 1)*q*14 + (a*2 + a - 2)*qQ*15 +4 

-a*2 - a + 2)*q*16 + (-a*2 + a + 2) *qQ*17 + 

(a*2 + 1) *q*18 + (-2*a*2 - a + 2)*Q*19 + O(q*20) 

> BaseRing (Parent ($1) ) ; 
Number Field with defining polynomial x*3 + 2*x*2 - x - 1 
over the Rational Field 

> Cputime (tt) ; 

0.160 


ll 
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This coincides with the result in section 4 in [6], and took merely 0.16s. 
Similarly, in only 0.33 s we get the result for X,*. (17). 


ns 


tt := Cputime(); 

G := GammaNSplus (17) ; 

M := ModularSymbols(G, 2, Rationals(), 0); 
S := CuspidalSubspace (M) ; 

D := Decomposition(S, HeckeBound(S) ) ; 


[xqEigenform(d,20) : d in Dx]; 
* 

- qe2 - q*4 + 2*qg*5 - 4%q*7 + 3xq*8 - 3%q*9 - 
2*q*10 - 
2*q*13 + 4%q*14 - q*16 + 3*qg*18 - 4*q*19 + O(q*20), 
q + axq*2 + (-a - 1)*q*3 + (-a + 1) *q*4 + axg*5 - 
3xq°6 + 
(-a - 2)*q*7 - 3*q*8 + (a + 1)*Q*9 + (-a + 3)*q*10 - 
3*q*11 + 


WMOnvvvVvV Vv 
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a + 2)*q*12 + (a - 1)*q*13 + (-a - 3)*q*14 - 3*q*15 + 
-a - 2)*q*16 + 3*qg*18 + (-3%a - 1)*q*19 + O(q*20), 

+ axq*2 + (-a*2 + 1)*q*3 + (a*2 - 2) *qQ*4 + 
a - 2)*q*5 + (-2*a + 1)*q*6 + (a*2 - 2)*qQ*7 + 


(a*2 - a - 2)*q*9 + (-a*2 - 2%a)*q*10 + 

(2xa*2 + 2xa - 6)*qQ*11 + 

(a - 2)*q*12 + (-2*a*2 - 3%a + 6)*q*13 + (a - 1)*q*14 + 
(2*a*2 + 2ea - 3)*Q*15 + (-3*a*2 - a + 4) *q*16 + 

(-a*2 + a -1)*q*18 + axg*19 + O(q*20 
*] 
> BaseRing (Parent (qEigenform(D[2]))); 
Number Field with defining polynomial x*2 + x - 3 
over the Rational Field 
> BaseRing (Parent (qEigenform(D[3]))); 
Number Field with defining polynomial x*3 - 3*«x + 1 
over the Rational Field 
> Cputime (tt) ; 
0.330 


which coincides with the results of [21]. 

In [5], section 6, there is a description of coset representatives for [,;(V) and 
[+ (N) for general N, which again makes explicit the implementation via modular 
symbols. 


7.4 Decomposition of the Jacobian of X*,(p) 


Applying Algorithm 6.2.3 to the space So(I*,(p)), where *,(p) is yields a 
decomposition of the Jacobian of X;‘,(p). This has been computed using our code 


for all p < 97 (see [1]). 


7.5 Computation of q-Eigenforms for X G(p) When G Is 
Exceptional 


Similarly, the code can be used to compute q-eigenforms for the modular curves 
XG(N), when G is an exceptional subgroup of GL2(F,) with surjective determi- 
nant, e.g. running it for Xs5,(13) we recover [4, Theorem 1.8]. We obtain step 3 
of section 4 in the original paper in just a few seconds. (We use the embedding of 
S4 <> GL2(F >) using the quaternions as described in [36, Section 11.5].) 


> tt := Cputime(); 
Sip .= 13% 
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a*2 + 2xa - 2)*q*5 + (-a - 1)*Q*6 + (a*2 - 3)*q*7 + 
-2*a*2 - 3%*a + 1)*q*8 + (a*2 + 3%a - 1) *Q*9 4+ 

-a + 1)*q*10 + (-a*2 - 2*a - 2)*q*11 + 

(a*2 + 3xa)xqg*12 + 

(-2*a*2 - 2ea + 1)*qQ*14 + (a*2 + a - 2) *q*15 + 

(-a*2 - a + 2)*q*16 + (-a*2 + a + 2)%q*17 + 

(a*2 + 1) *q*18 + (-2*a*2 - a + 2)*Q*19 + O(q*20) 


> B<i,j,k> := QuaternionAlgebra(Rationals(), -1, -1); 
> O := QuaternionOrder([1,i,j,k]); 

> _, mp := pMatrixRing(O,p) ; 

> S4tp := sub<GhL(2,p) | [mp(1+s) : s in [i,j,k]] 

> cat [mp(l1-s) : s in [i,j,k]] cat [-1]>; 

> H S4 := sub<GhL(2,Integers(p)) | Generators (S4tp) >; 
> G S4 := PSL2Subgroup(H_ S4) ; 

> M := ModularSymbols(G_S4, 2, Rationals(), 0); 

> S := CuspidalSubspace (M) ; 

> D := Decomposition(S, HeckeBound (Ss) ) ; 

> [*qEigenform(d,20) : d in Dx]; 

[x 

q + axq*2 + (-a*2 - 2«a)xq*3 + (a*2 - 2)*qQ*4 4+ 

( 

( 

( 


> BaseRing (Parent (qEigenform(D[1]))); 

Number Field with defining polynomial x%*3 
+ 2*x*2 - x - 1 

over the Rational Field 


7.6 Smooth Plane Models for Modular Curves 


Modular curves of genus 0, | obviously admit a smooth plane model always. In 
general, if a curve admits such a model of degree d, then its genus would be g = 
(d=)(4=?) | so the next numbers to check are g = 3,6. When g = 3, by Gonzalez- 
Jiménez and Oyono [16], the generic case is a smooth plane quartic, so the answer 
is “almost always”. 

However, for g > 3, a generic curve does not admit a smooth plane model. 

The smallest case for which we do not know the answer is g = 6. Thus, 
computing a basis of eigenforms of S2([°) for each of the congruence subgroups 
I’ of genus 6 (see [13]), will help us to get a canonical model and from it (maybe) 
decide whether there exists a plane model. 


Computing Classical Modular Forms for Arbitrary Congruence Subgroups 103 


References 


1. 


Eran Assaf. Dimensions of the decomposition of the Jacobians of X;1,(p) for primes p < 
100 and characteristic polynomials of the Hecke operators T,,. https://github.com/assaferan/ 
ModularSymbols/blob/master/dec_dims_char_polys_ns_cartan.dat. Last accessed: 2020-02- 
03. 


. Eran Assaf. Modular symbols package for arbitrary congruence subgroups. https://github.com/ 


assaferan/ModularSymbols/. Last accessed: 2020-02-03. 


. A Oliver L Atkin et al. Twists of newforms and pseudo-eigenvalues of W-operators. Inventiones 


mathematicae, 48(3):221—243, 1978. 


. Barinder Banwait and John Cremona. Tetrahedral elliptic curves and the local-global principle 


for isogenies. Algebra & Number Theory, 8(5):1201—1229, 2014. 


. Burcu Baran. Normalizers of non-split Cartan subgroups, modular curves, and the class number 


one problem. Journal of Number Theory, 130(12):2753—2772, 2010. 


. Burcu Baran. An exceptional isomorphism between modular curves of level 13. Journal of 


Number Theory, 145:273-300, 2014. 


. Alex J. Best, Jonathan Bober, Andrew R. Booker, Edgar Costa, John E. Cremona, Maarten 


Derickx, Min Lee, David Lowry-Duda, David Roe, Andrew V. Sutherland, and John Voight. 
Computing classical modular forms. In Arithmetic Geometry, Number Theory, and Computa- 
tion, pages 131-214. Springer, Cham, 2021. https://doi.org/10.1007/978-3-030-80914-0_4 


. Wieb Bosma, John Cannon, and Catherine Playoust. The Magma algebra system I: The user 


language. Journal of Symbolic Computation, 24(3-4):235-265, 1997. 


. Josha Box. Computing models for quotients of modular curves. Res. Number Theory 7(3):51, 


2021. 


. Kevin Buzzard. Computing weight one modular forms over C and Op. In Computations with 


modular forms, pages 129-146. Springer, 2014. 


. Jean-Marc Couveignes and Bas Edixhoven. Computational aspects of modular forms and 


Galois representations. Princeton University Press, 2011. 


. John E Cremona. Algorithms for modular elliptic curves. 2nd edition. Cambridge University 


Press, Cambridge, 1997. 


. Chris Cummins and Sebastian Pauli. Congruence subgroups of PSL2(Z). https://mathstats. 


uncg.edu/sites/pauli/congruence/csg6.html. Last accessed: 2020-02-03. 


. Fred Diamond and Jerry Michael Shurman. A first course in modular forms, volume 228. 


Springer, 2005. 


. Valerio Dose, Julio Fernandez, Josep Gonzalez, and René Schoof. The automorphism group 


of the non-split Cartan modular curve of level 11. Journal of Algebra, 417:95-102, 2014. 


. Enrique Gonzalez-Jiménez and Roger Oyono. Non-hyperelliptic modular curves of genus 3. 


Journal of Number Theory, 130(4):862-878, 2010. 


. Lloyd JP Kilford. Modular forms: a classical and computational introduction. Mathematical 


intelligencer, 32(3):58-59, 2010. 


. Ravi S Kulkarni. An arithmetic-geometric method in the study of the subgroups of the modular 


group. American Journal of mathematics, 113(6):1053-1133, 1991. 


. Jul Manin. Parabolic points and zeta-functions of modular curves. Mathematics of the USSR- 


Izvestiya, 6(1):19, 1972. 


. Barry Mazur. Rational points on modular curves. In Modular functions of one variable V, 


pages 107-148. Springer, 1977. 


. Pietro Mercuri and René Schoof. Modular forms invariant under non-split Cartan subgroups. 


Mathematics of Computation, 89(324):1969-1991, 2020. 


. Loic Merel. Opérateurs de Hecke pour I'9(N) et fractions continues. In Annales de l’institut 


Fourier, volume 41, pages 519-537, 1991. 


. Loic Merel. Universal Fourier expansions of modular forms. In On Artin’s Conjecture for Odd 


2-Dimensional Representations, pages 59-94. Springer, 1994. 


104 E. Assaf 


24. James S Milne. Introduction to Shimura varieties. Harmonic analysis, the trace formula, and 
Shimura varieties, 4:265—378, 2005. 

25. Andrew Ogg. Modular forms and Dirichlet series. WA Benjamin New York, 1969. 

26. Jeremy Rouse and David Zureick-Brown. Elliptic curves over Q and 2-adic images of galois. 
Research in Number Theory, 1(1):12, 2015. 

27. George J Schaeffer. Hecke stability and weight 1 modular forms. Mathematische Zeitschrift, 
281(1-2):159-191, 2015. 

28. Jean-Pierre Serre. Propriétés galoisiennes des points d’ordre fini des courbes elliptiques. 
Invent. math, 15:259-331, 1972. 

29. Goro Shimura. Introduction to the arithmetic theory of automorphic functions, volume 1. 
Princeton University Press, 1971. 

30. Vyacheslav Vladimirovich Shokurov. The study of the homology of Kuga varieties. Izvestiya 
Rossiiskoi Akademii Nauk. Seriya Matematicheskaya, 44(2):443-464, 1980. 

31. William A Stein. An introduction to computing modular forms using modular symbols. In An 
MSRI Proceedings, 2002. 

32. William A Stein. Modular forms, a computational approach, volume 79. American 
Mathematical Soc., 2007. 

33. William Arthur Stein. Explicit approaches to modular abelian varieties. PhD thesis, University 
of California, Berkeley, 2000. 

34. Andrew Sutherland and David Zywina. Modular curves of prime-power level with infinitely 
many rational points. Algebra & Number Theory, 11(5):1199-1229, 2017. 

35. John A Todd and Harold SM Coxeter. A practical method for enumerating cosets of a finite 
abstract group. Proceedings of the Edinburgh Mathematical Society, 5(1):26-34, 1936. 

36. John Voight. Quaternion Algebras. Graduate Texts in Mathematics, 288. Springer, 2021. 

37. Gabor Wiese et al. Modular forms of weight one over finite fields. PhD thesis, Stieltjes Institute 
for Mathematics, Faculty of Mathematics and Natural Sciences, 2005. 

38. David Zywina. Computing actions on cusp forms. arXiv preprint arXiv:2001.07270, 2020. 


Square Root Time Coleman Integration M®) 
on Superelliptic Curves sei 


Alex J. Best 


Abstract Since Kedlaya first introduced a p-adic algorithm for computing zeta 
functions of hyperelliptic curves, many related algorithms for computing both 
zeta functions and Coleman integrals on various classes of algebraic curves have 
been studied. These algorithms compute in the Monsky-Washnitzer cohomology 
or the rigid cohomology of the curve to determine the action of Frobenius on this 
cohomology. 

We give a new algorithm for explicitly computing Coleman integrals on superel- 
liptic curves over unramified extensions of p-adic fields. The runtime is softly linear 
with respect to the square root of the size of the residue field, bringing the runtime 
in line with that of the corresponding zeta function algorithms. We also describe the 
implementation of this algorithm in Nemo, a new package for the Julia programming 
language, which adds functionality for computational number theory. We compare 
Nemo with other systems in use in this area. 


1 Introduction 


In [18], Kedlaya introduced an algorithm using p-adic cohomology to compute the 
zeta function of a hyperelliptic curve defined over a finite field. Since then, there 
have been advances and generalisations of this method in (at least) three different 
directions. Firstly [13, 14] and [22, 23] have introduced variants that work on more 
general curves, respectively; superelliptic curves, cyclic covers of P! and general 
curves with a map to P, Secondly, Harvey [16] introduced a variant that runs in time 
quasilinear in ./p. Finally, Balakrishnan-Bradshaw-Kedlaya [1] gave an algorithm 
to compute p-adic Coleman integrals on curves over the p-adics. A large part of the 
runtime of this algorithm is taken up by computing explicit relations in Monsky- 
Washnitzer cohomology, the same core procedure used in Kedlaya’s algorithm. 
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In [20], Minzlaff showed that the first and second of these could be combined; 
that is, there exists an algorithm that computes the zeta function of superelliptic 
curves over F,» in O (p'/*) time. Additionally the paper [7] gives an algorithm 
based on the work of Balakrishnan-Bradshaw-Kedlaya and of Harvey that computes 
Coleman integrals on hyperelliptic curves over Q, in time quasilinear in ./p. 

In this paper, we incorporate the work of Minzlaff into that of [7]. Let a, b be 
coprime integers with a > 1, b > 2, let p be a prime and g = p", take h € Z,[x] a 
squarefree polynomial of degree b, and consider the curve 


C/Zq: y4 = h(x). 


Let g = (a — 1)(b — 1)/2 be the genus of C. Let M be the matrix of g-power 
Frobenius acting on Hig(C Xz, Qq) (via comparison with Hi. (C xz, Fa, Qq)), 
in terms of the basis B = {@;,; = de Wie, b-9 feta and N € Ns be such 
that both C and points P, Q € C(Qp) are known to precision i. and assume 
p > (aN — 1)b. Then, if multiplying two m x m matrices requires O(m®) ring 
operations we have: 


Theorem 1.1 The vector of Coleman integrals ( i Es ®)wep can be computed in time 
O (s°/pnn*? + N4¢4n? log P) 


to absolute p-adic precision N — vp(det(M — 1)). 


The work of Balakrishnan-Tuitman [4] has shown that there exists an algorithm 
to compute Coleman integrals on general curves defined over p-adic fields. This 
article shows that fast Coleman integration algorithms are not limited just to the 
hyperelliptic case. 

We also take the opportunity to work out the practical details of the theory 
without restricting to the case of monic h(x) or curves only defined over Q,; instead 
we work over arbitrary unramified extensions of Q, throughout. 

The algorithms described in this paper have been implemented using the Nemo 
computer algebra system [12], which is a specialised package for number theory 
and related mathematics for the programming language Julia [8]. This is a relatively 
new system for computing in commutative algebra, number theory and group theory 
that is based on several low-level libraries such as MPIR, Flint, Arb and Antic. 
A secondary goal of this paper is therefore to compare the performance of the 
various computer algebra systems available to mathematicians for running this type 
of computation. The implementation also allows for h to be non-monic and the curve 
C to be defined over an unramified extension of Qp. 
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2 Set-Up and Notation 


The major outline of the algorithm presented in this paper follows that of [7], 
adapted to the case of a (tamely) superelliptic curve: these will be smooth 
projective curves C of a specific form, which will be defined over the ring of integers 
of an unramified extension of Q, (such extensions will be denoted as Q,, where 
q = p” is the cardinality of the residue field). The present work can be read in 
conjunction with [7]. 

We will denote by o the Witt vector Frobenius on Zg and Q,. The restriction of 
being superelliptic for us means that the curve should be given by an affine equation 
of the form 


y* = h(x), deg(h) = b, h squarefree 


where gcd(a,b) = | anda > 1,b > 2. Tameness means that, in addition, we 
assume p { a, though this will be implied by our later assumption (2.1). We will 
write h = Ax? + h with degh < b. We can view the projective curve C as living 
inside of the weighted projective space P(a, b, 1), where there is a unique point at 
infinity on the curve, denoted oo. As h might not be monic, this will be of the form 
(A' : AS : 0) for some s,t € Z such that as — bt = 1. A superelliptic curve of this 
form will have genus 


_ hea YO=—D 
= i ; 
Each set of points DC C (Q,) that is the full preimage of some x € CR, (Fy) 


under the reduction map is known as a residue disk. 
Such a superelliptic curve comes equipped with a natural map to P! given by 


(x: yiz)hR (x: 2%). 
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We will often be interested in the points where this map is branched (i.e. those with 
y = 0), and call them the branch points of this curve. We also call any residue disk 
containing one of these points a branch disk. 

These play the analogous role to that of the Weierstrass points in the hyperelliptic 
case, but we prefer to avoid calling them Weierstrass points as the notion of a 
Weierstrass point in algebraic geometry would include non-branch points in general, 
see [21, p. 3372]. 

We will often work with the affine open subset of the curve obtained by removing 
all branch points (including oo), writing 


A =Zy[x, y, y'W/(y" — h@)). 
This affine space is then 
SpeccA=UCC. 


We denote by A‘ the weak completion of A; this is the ring of formal power series 


[o.@) 
, R; Vy (Rj 
Abs i) : Rj € Zg[x]deg<b—1, liminf ph i) >O}. 
oe ~ fico [i 


We will consider the module of 1-forms 
Q), = Atdx @ Atdy/(ay*!dy — h’(x)dx)), 
and the exterior derivative map is denoted 
d: AT > Qh. 


We also use Ajo, to denote the Q,-algebra of Q, -valued functions on C (Q,) that 
are given by a convergent power series on each residue disk and are Gal(Q,/ Q,)- 
equivariant. Note that AT <> Ajoc. The Monsky-Washnitzer cohomology of A is 
then defined to be Hyiw(A) = & a /d(A‘). 

From now on, we make the assumption that 


p > (aN — Ib; (2.1) 


this simplifies the analysis of denominators appearing later in the algorithm. 
This assumption is likely completely removable without affecting the asymptotic 
complexity, or at least smaller p may be used by doing a more involved precision 
analysis to ensure the right amount of extra precision is used. 
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3 Coleman Integration 


Coleman integration is an integration theory that is in particular defined for 1-forms 
on curves, viewed as p-adic analytic spaces. This theory has been applied to various 
questions in the arithmetic of curves over number fields and p-adic fields. These 
applications include provably determining the set of rational points on a curve using 
Chabauty-Coleman(-Kim) [19, 3] and determining torsion points on curves. For 
background on Coleman integration, especially the details of the action of Frobenius 
when n > 1, we refer to [5]. 

Coleman integration for curves is computable in many cases [6, 1, 4]; that is, 
given a base point and a 1-form to some finite p-adic precision, the corresponding 
integral can be computed to some (smaller) precision. This can be done in many 
instances of interest to arithmetic geometers. Moreover these algorithms perform 
well in practice, returning an answer in examples of interest to mathematicians in 
an acceptable real-world runtime. 

Coleman integration requires the choice of a lift of Frobenius, that is a map 


¢: A‘ > At 
such that the reduction of this map to Fy 
a. at t 
dp: Ag, > Ag, 


is the p-power Frobenius map and $(k) = o(k) for all k € Z,. There are many 
possibilities for such a lift in general; in this paper, we will make an explicit choice 
of a lift for superelliptic curves that allows us to analyse its action in detail. It is 
therefore important to note that the theory is known to be independent of the choice 


of @. 
We define the action of ¢ on C (Q,) via 


(x0, yo) = (0! (@(x)(x0, yo), 7 | (H(¥) (x0, Yo)))- (3.1) 


On functions f: C (Q,) > Q, the action of @ is then 


b(f)(P) = of @(P)). (3.2) 
The key theorem proved by Coleman describing this integration theory can be 
stated in our setting as follows. 


Theorem 3.1 (Coleman) There is a unique (up to a global constant of integration) 
Q, -linear integration map ia Qh @ Qy = Aloc satisfying the following: 


(1) Frobenius equivariance: | (go) =o (f o) 
(2) the fundamental theorem of calculus: do f is the canonical inclusion 
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1 1 
Qi ® Q, i ig 


(3) and Fi od is the natural map Ais Aloc/ (constant functions). 


Given points P,Q € C(Q,) the definite integral [Po is then defined as 
(f o) (Q) - (f w) (P), which is a well-defined function of P, Q. 


Balakrishnan-Bradshaw-Kedlaya [1] describe how, using these properties, the 
computation of Coleman integrals of 1-forms on a curve can be broken up into two 
parts: 


(1) The computation of tiny integrals between points in the same residue disk, using 
local coordinates on the curve. 

(2) The evaluation of exact forms appearing when reducing Frobenius pullbacks of 
differentials in Monsky-Washnitzer cohomology. 


In the remainder of this section, we give a description of local coordinates that 
can be used to compute tiny integrals and describe how this method works in our 
setting in more detail, and in the next section we describe a procedure for evaluating 
the relevant exact forms. 


3.1 Local Coordinates on Superelliptic Curves 


In order to compute Coleman integrals locally on a p-adic disk, we need an explicit 
local parametrisation of such a disk. To do this we apply Newton’s method/Hensel’s 
lemma; to find a power series solution x(t) to a polynomial F(x(t)), we start with 
some xo(t), and iterate xj(t) = xj-1(t) — F(xj-1(0))/F'(Qi-1(t)). The limiting 
power series is the desired solution; applying this directly with F(x(t), y(t)) = 
y(t)* — h(x(f)) in the x or y variable leads to the next results, giving local 
coordinates around a finite branch point or non-branch point. 


Proposition 3.2 (Local Coordinates Around a Point not in a Branch Disk) Let 
P € C(Q,) be a point not in a branch disk, represented as P = (X,Y) on y* = 
h(x). Then local coordinates (x(t), y(t)) around P can be given by 


x(t) =X +t, 

yo(t) = ¥, 

yi(t) =~ (« ~ Dy) + a) and 
a yi-1(t)* 


yt) = Hs yi(t). 
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Proof We may simplify 


ner yi-10)* —h@@) a. h(x(t)) 
yi(t) = yi-1(t) ayia! = yj_1(t) Zii-1®) + aoe 
er hex(t)) 
a (« pet art) 


| 
Proposition 3.3 (Local Coordinates Around a Point in a Finite Branch Disk) 
Let P € C(Q,) be a point in a non-infinite branch disk, represented as P = (X,Y) 
on y* = h(x). Then local coordinates (x(t), y(t)) around P. can be given by 
yO=V +t, 
xo(t) = X, 
y(t)* — hA(Qi-1(0)) 


i(t) = xj-1(t , and 
Xi (t) = xj-1(t) + h'(x_10) an 
x(t) = lim x;(t). 
I> € 
Proof Direct application of Newton’s method. | 


For the infinite disk the choice of uniformiser is less obvious and several different 
choices can be made. For instance, one could take a local coordinate t at oo to be 
such that x = ¢~“. Then given a point (x9, yo) in the same residue disk as 00, to 
find the value of the parameter ¢ that gives (xo, yo), we can take each of the possible 
roots ~/x9 and check which of these give yo. 

Instead, we take ¢ to be an appropriate monomial in x, y. This has the advantage 
that, to find the value of the local coordinate for a point (Xo, Yo), we may compute 
the corresponding coordinate to using only multiplication of the values Xo, Yo. 
Precisely, we do the following: 


Proposition 3.4 (Local Coordinates Around 00) Let oo be the point at infinity 
on C. Then local coordinates (x(t), y(t)) around oo can be given as follows. Let 
£,k be such that bt — ak = 1. Set 


xo(t) = 4~°t74, 


yot) =a *t-?, 
WG) ae 
J(x,y)= ; 
(x,y) (Gee lige) 


x; (t) xi-1(t) x ( tyi1(t)* — xi1(0)* ) 
= — J(xj-1(0), yi- ; 
c) Ge FA OY Neha @) 


112 A. J. Best 
Then 
x(t) = lim x;(¢), 
I> oo 
y(t) = lim yj(t). 
100 
Proof Using the assumption that gcd(a, b) = 1, we may find @, k such that bé — 
ak = |. We now find x(t), y(t) such that tf = x(t)¥/y(t)®. To do this, we solve 


ty(t)® = x(t)‘ and y(t)" = h(x(t)) simultaneously using multivariate Newton’s 
method; this gives the recurrence stated. | 


When working with hyperelliptic curves, one can use the hyperelliptic involution 
to avoid computing integrals from infinity to points in branch disks, where the exact 
forms that would need to be evaluated fail to converge. We can also avoid this issue 
when working with superelliptic curves as follows: 


Proposition 3.5 Fix to be the automorphism of the cover 
c>Pp! 
given by 


“uw: CoC 
(x,y) > (X, bay) 


for some fixed primitive a-th root of unity fq € Q,. If P, Q are branch points and w 
is a differential for which p*a = cb w for some B € {1,...,a— 1}, then 


P 
i o=0. 
Q 


Proof We have 
Q uQ P Q 
/ o= | w= | wo=se f Oo. 
P uP Q P 


Corollary 3.6 Let P be a point in a branch disk. We have for any branch point Q 


00 Q 
i, o= | Qo. 
P P 
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Choosing Q to be the branch point in the same disk as P we can therefore 


compute 
[o,2) 
[ie 
P 


by expanding only in local coordinates around Q. 


3.2 The Action of q-Power Frobenius 


We now assume that we know the action of ¢ on Q Ae in the sense that we have the 
following: 


(1) A fixed basis (@;); of Hua), thought of as a column vector 
(2) a matrix of Frobenius MV, 
(3) and a vector of primitives (f;); such that, 


(¢* ai); = M(wi)i + fii € (2),5)°8. 


This data is what will be returned by (our generalisation of) Kedlaya’s algorithm. 
The next lemma calculates the action of ¢*” on Q at from this data; this is 
consistent with [2, Rmk. 1] but appears to be different from [1, Rmk. 12]. 


Lemma 3.7 The action of a power of Frobenius on the basis differentials is given by 


eo)i= > I] vap|e"GdAi+ [] # Wei. 


t=n—1.,..., 0 \s=n—-1,...,t+1 s=n—1,...,0 


Proof By induction on n, we will apply the relation 
$* (@i)i = (Afi)i + M(i);- 
The base case 
¢(w)i =0+1- (di 


holds trivially (as does the n = | case which is simply the fundamental relation 
above), and we have 


vrion =e ( y ( I] van) eras) oo I] vane) 
st+l1 Ss 


t=n—1,..., 0 \s=n—-1,...,t4 s=n—l,..., 0 


= - Se ( Il a Il 
t 0 \s Ss 


I] etl(m) grt 
t=n—-1,...,0 \s=n—-1,...,t+1 


gtmydf)it T] 
(0) Ss 


Mahi 


ot | (M)M(@,); 


= os ( I] an) o'tlaf); 
0 


s=n-—-l,...,t+1 


oMandfn+ T] uni 


os ( I] van) orianns I] 
0 \s=n,...,t+2 Ss 


t=n—-1,...,0 \s=n,..., s=n—l,...,0 
=» I] ¢ao)eapit+ T] &’aoe@ii. 
t=n,...,.0 \s=n,...,t+1 s=n,...,0 


ot Md fi)i + 
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o°* | (M)o* ((@;)i) 


TI] @&@0@i 


|_| 
We therefore have 
( Q ( og" P ¢"Q Q 
ve Oj _ i wi +/ Wj + Oj 
P i P oO ff; 
g"P Q Q 
=(f oi + f « +. De I] em (/ eas) 
P #2 7;  t=n-1,...,0 \s=n-1,...,t41 P. i 
: Q 
+ [J em | oj 
s=n—l,..., 0 P i 
ane 2 Y e t t 
= | w+ f a)+ > I] #an] (f@'Q) - f¢'P)); 
"OQ 7;  t=n—1,...,0 \s=n—l,...,t41 
Q 
+ [JT #MM ( / «] 
s=n—1,...,0 P i 
hence 
(3.3) 
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$"P Q 
(/ «) +(f o») - > ( Tl van) (FG! Q) — fle" P)),. 
3 i "OQ Si o\s vat tl 


t=n—1,..., 
(3.4) 


So computing ( i M4 wi), reduces to computing (3.4) and then inverting the matrix 
l 


i= == ae 0 p*(M). 

Thus in order to compute the Coleman integrals between points P, Q we need 

to know M and evaluations of ¢’ f; at the points P, Q fort = 0,...,n — 1. From 
(3.1), (3.2) we see that to compute each (¢' f;)(P) we need to compute f;(¢'’ P) and 
apply o°. 
Remark 3.8 This makes the Teichmiiller point variant of Coleman integration 
algorithms more appealing when working over extensions. In that approach, the 
Teichmiiller point is used in each non-branch disk as a base point and all integrals are 
computed in two parts: an integral to the Teichmiiller point and then tiny integrals for 
reaching the rest of the disk. The advantage is that all of the points ¢' P will be equal 
for a Teichmiiller point P, hence also the values f;(' P), so fewer evaluations of f; 
need to be carried out. We do not take this approach in the current implementation, 
however cf. Remark 6.1. 


4 Reductions in Cohomology 


We now describe the reduction process. The foundation for this is the work of Min- 
zlaff in [20], which gives explicit maps, reducing elements of Qh to cohomologous 
ones with smaller x- or y-degree. To compute Coleman primitives, we must, in 
addition, record the exact forms subtracted to obtain these cohomologous elements 
and determine a recurrence that computes the evaluation of the sum of these forms 
at the end of the reduction process. 

We consider the following spaces of differentials for s,t ¢€ Zand s > —1: 


Wor = (xiy J! -xSy “de :0<i <b-2,1<j<a-1}, 


and for s = —1 we take only the subspace with i > 1. 

These differentials are such that W_1,9 is spanned by the basis chosen in 
Theorem 1.1. 

We may decompose W, + into eigenspaces under the action of the superelliptic 
automorphism ,z to obtain 


Wor = We, B-- OWA" 
where 


Wi =ialy Pay Ode 37S 2). 
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As the subspaces indexed by £ are all preserved in everything that follows, we 
consider each 1 < B < a — 1 independently from now. 

We may choose a lift of Frobenius on C by letting 7: Q, —> Qa, be the Witt 
vector Frobenius and setting 


o(x) =x?P 
as shown in [20, Sec 4.] this forces us to take 
al k 
= \ (a(h) — hh?) 
— yp 
o(y)=y yD 2) ype 


Minzlaff also determined an approximation of the action of Frobenius on the basis 
differentials. Specifically we have 


Lemma 4.1 ((20, Prop. 4.1]) Jf we let ie be the coefficient of x" in h* and 
N-1 = i e 
Hirj = POO) D> eae 7 (;) oe 
t=k 


then the reduction in cohomology of the I-form 


peggy Pe de 


is congruent to that of o(x'dx/y/) modulo p. Moreover the exact forms that 
reduce these respective cohomology classes to the cohomologous linear combina- 
tion of our chosen basis elements are also congruent modulo p’. 


Proof We denote by D, the divisor C \ U. Then Minzlaff [20, Lem. 3.4] shows 
that for any w € (24/z, with pole divisor 


(@)oo < mDy, 
we have 
plse™—D 1 (e) € Qa/z,- 
However the proof of this statement given there shows that in fact if 


@=1(w)+dF 


then we also have 
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plleepon—D] pp ed: 


As in the proof of [20, Prop. 4.1], we apply this to the series 


oo i 
o(x'dx/y/) = )° r( (rou — hP yp D-ly- Pkt Day, 
k=0 


and writing S, for the kth summand of the infinite sum on the right, we need to show 
that if k > N and 


Sk = 1(Sx) + dF x, 
then z($;) = 0 (mod py) and F, = 0 (mod ae Fixing k > N, we can write 


(k+1)p-1 
Se =p! SS foxy dx, where my = —p(ak + j) +al 
£=0 


with fe € Zg[X]deg<p-1- 
So we apply the above integrality statement to each 


fe(x)y™ dx. 


The same power of p used by Minzlaff is therefore sufficient. a 


The reduction proceeds in two stages, first horizontal reduction that reduces the 
index s and hence the power of x appearing in the 1-forms and leaves us with forms 
in W_,,; to consider. Then vertical reduction is performed that reduces the power of 
y appearing and reduces all forms to W_1 0. 

We present the vertical reduction step first however, as although it takes place 
after the horizontal steps when the algorithm is executed, knowing the form of 
the vertical reduction ahead of time allows us to make some simplifications in the 
description of the horizontal reduction. 


4.1 Vertical Reduction 


Because h is squarefree and i is a unit, we can find for eachi = 0,..., b—2, a pair 
of polynomials R;, S; € Z,[x] with deg R; < b — 2 and deg S; < b — 1, such that 


xi = Rh + Sih’. 


We fix a choice of such polynomials now. 
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The vertical reduction proceeds by reducing a 1-form in wé 1, to a cohomolo- 


gous one in w? 1.¢_1 USing the following lemma: 


Lemma 4.2 (Vertical Reduction) We have 


—a 


i —at—Bq ae d(S; —at—B+a 
xy x aa ) 
at + B — a)Rj(x) + aSi (x 
— ( B ) i ( ) "( ) y-a0-D-B gy c Wis 
at+B-a 


Proof Via our choice of R;, 5; we have that 
xby UP dx = (Ri (x)a(x) + Si(x)h' (x)) y~" Pd, 
and also that 


dS, ey OF) = Sia) yO de & (ar = 6 Fa) S@)y Ody 


1 
= Si(xyy PH dx + (—at — B+ a)-S; (x)y~ Fh! (x) dx 


so 
—a —a 
d S; —at—Bta = Ss! —at—ptag S. —at—B p/ d : 
ae a (Si(x)y ) Pray eee i(X)Y x+Si(x)y (x)dx 
and the above equality holds. | 


This reduction is a Q,-linear map 
B B 
Weig Weg 


To express it in matrix form, we denote the coefficients of R; by r;,; and likewise 
Si for those of S;, each R; and S’ is of degree at most b — 2. 

To compute evaluations of the primitives for Coleman integration, we augment 
these linear maps to also include a vector of length L, that holds the data of 
evaluations of the exact form subtracted so far at several points P; <€ C(Q,), 
i = 1,...,Z. We assume that all of these points do not lie in a branch disk 
(including the infinite disk). This implies that x(P;) and y(P;) are both integral 
for all i and that y(P;) is a unit in Z,. So we have linear maps 


L L 
Weis x Q, rae we, x Q,. 


Doing this as described gives a matrix whose entries are non-linear functions of the 
index t, due to the presence of the term y~“’+¢~8 in the exact form 
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—a 


S. —at—p+a 
ai +B —a i(x)y 


To remedy this, we use the approach of [7] and modify the reduction process to 
obtain matrices with entries linear functions of t, by factoring out the powers of y 
in the exact form. As we reduce monomials from we i eae wé Le We can let each 
later reduction step multiply the exact form by x. This results in a total power of x* 
when the reduction finishes in W? 


“14> Which is the same power of x as in the exact 
form obtained from W? 


B 
sit > Wey 


Let De be the scalar at + 6 —a € Z,[t] and define a (b—1+L) x (b—1+L) 
matrix 


RY) = MEODE (OT, 
where M : (t) is the matrix 


bt (t)ro,o + aS.9 on Dt (t)rp—-2,0 + 45,5 9 


DY (t)ro,p—2 + ASO p_2 tee bt (t)rp—2,p—2 + aS), 9 p_2 
—aSo(x(P1))y(Pi)~P*4 «+ —aSp—2(x(P1))y(Pr)78** y(P1)~4 DE (1) 0 


—aSo(x(Pr))y(PLyP44 ++ aSp_2(a(PL))y(PryP 0s y(Pr)* DED) 
(4.1) 


4.2 Horizontal Reduction 


We now describe the horizontal reduction process. 
The horizontal reduction proceeds by reducing a 1-form in we 41, t0. a cohomol- 


ogous one in we ,; using the following lemma: 


Lemma 4.3 (Horizontal Reduction) We have 


b+s—1 —at—B q d a 5.—at+a—B 
_ “ (oa m 


a(sh — 1(at —a + B)xh' 
= ( a B) ) ps1 y—at-B gy c We. 
ash — (at —a+ B)AB 


Proof We directly compute 


diy FF) ayah ly Pd ca (at —a+t B)xsy—ata-B-lay 
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1 
= (se ——-(at+B- ax'(s)) y Bd 
a 
> 1 s 
= (sta! +h) — —(at + B — a)x* (Abx?-4 + i’) y Bd 
a 
S 1 of 
= (sos! +h) — —(at + B—a)x(Abx?4 + i’) arg aE de: 
a 


Therefore, by subtracting d(xSy—@'+4-P) from x?x5 y—4-Bdx, the 


a 
ash—(at—a+B)bir 
remaining terms are all as stated, and of lower degree so that they lie in we re | 


As above we augment our linear maps 


we 


B 
sti, > Ws, 


to also include a vector of length L that holds the data of evaluations of the exact 


form subtracted so far at P; € C(Q,),i = 1,..., L not in the branch disks: 
L L 
Weis x Qo > we, x Q7. 


Doing this gives a matrix whose entries are non-linear functions of the index, due to 
the presence of the term x* in the exact form 


—a 


A((at —a+ B)b—as) 


hy GRE ; 


Instead we multiply the reduction evaluation computed so far by x(P;) at each 
reduction step which results in a total power of x* when the reduction finishes in 


w? ir: This is precisely the power of x in the exact form obtained from we oy; 


we ,. We also do not multiply by y~@'+¢-8, as this is the same power of y that is 
multiplied by in the vertical reduction steps that take place after the horizontal ones. 

Rephrasing this in terms of a matrix, the reduction process is given by multiply- 
ing by a matrix of the form 


> 


—1 
RYP(s) = MEP (s) DEP (8), 


where De (s) = A((at —a+B)b—as), and pe is the linear function of s obtained 


as the coefficient of x! in ash(x) —(at-—a+ B)xh' (x) and where MiP (s) is the 
matrix 
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1B 
tp 5 te 

Dy (s) ++ 0 Py 
d sD Ga, (4.2) 
0 = 0 a |x(P) DEF G)--- 0 
0 ++ 0 =a O +++ x(PL)DyP(s) 


This is the linear recurrence to which we may now apply the modified Bostan- 
Gaudry-Schost algorithm [7, Thm. 5.2], as compute the result more efficiently than 
the step-by-step approach alluded to above. 


Remark 4.4 This is a slight modification of the approach used in [7, Sec. 4]. Here 
we make use of the commonality between the factors appearing in the vertical 
and horizontal stages to give a cleaner recurrence without the “correction factors” 
appearing in [7, Thm. 4.4]. 


Remark 4.5 (Regularity) For some applications (such as Chabauty-Coleman) one 
is only interested in invariant differentials on the Jacobian, or, equivalently, regular 
1-forms on the curve, as these are the forms whose integrals define abelian integrals 
and provide logarithm-type maps to Q,. In [21, Prop. 2] it is shown that of the 2¢ 
basis differentials x'dx/y/, as above, the g regular 1-forms are those for which 


la alala) | 


It is interesting to note that the algorithm developed here computes the integral of 
all 2g basis differentials simultaneously and there seems to be no way of computing 
integrals of only a subset of them via this approach. 


5 Runtime and Precision 


We now analyse the runtime of the above algorithm, proving Theorem 1.1. We only 
analyse the steps which differ from that occurring in [20, Sec. 6]. All preparatory 
steps remain the same. It is only the main reduction steps where we now have larger 
matrices to compute the evaluations of exact forms. 

We may apply [7, Thm. 5.2] as the reduction matrices above are (m+n) x (m+n) 
matrices with an m x n block which is zero and ann x n block which is diagonal. 
This uses 


O ((mMim) + MM(m, n))VK + (m? es mn) M(V/K)) 
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ring operations. We apply this for each row of horizontal reductions (with K = 
O(pN) in the worst case) and for the vertical reductions also (where K = O(pN) 
again). 

The precision loss and gain throughout the algorithm is as in [7, Sec. 7] and [20, 
Sec. 5.3]. We can therefore compute (3.4) for Q = oo and L distinct Teichmiiller 
points P via the above reduction procedure to precision N in time 


O ((g” + Lg®-!),/pnN>? + N*93(g + L)n” log ) 


where O(t) := O(tf(ogt)) for some polynomial f. The precision loss from 
inverting | — [] 9 ? (M) is precisely 


s=n—l,..., 
vp | det | 1 — Il ¢*(M) | | = #Jac Cr», (Fp). 
s=n—1,..., 0 


Applying the above to L = 2 gives Theorem 1.1. 


6 Implementation 


We have implemented the algorithm outlined above in Julia using the functionality 
provided by the Nemo package [12] and its extension Hecke. This implementation 
is available online at https://github.com/alexjbest/Coleman.jl. 

In this section we describe the implementation and discuss some surrounding 
issues. To check correctness of the algorithm and implementation, we begin with 
some examples: 


6.1 Examples 
6.1.1 A Picard Curve 
The curve 
Cs yp? = h(x) = x44 7x3 + 3x7 — x 


was suggested to me by Hashimoto-Morrison [17], it has an algebraic 9-torsion 
point 


P = (1, Y10), 
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so we take p = 41, which splits inside the ring of integers of Q(</10) as a product 
of a prime of norm 41 and one of norm 417, using the latter embedding we can take 
P € C(Qy,2). Explicitly if Qy,2 = Qui[a]/(a? + 38a + 6) we have 


P=(1+ O(415), (14430- 41! 4 19-412 +24-41°+35-41>)-a@ 


+11+20-41! + 33-41? + 23-413 + 32-414 + 34-415 + 0(41%)). 


As the x-coordinate of this point is 1, it is actually a Teichmiiller point (fixed 
under @), thus we do not have to do any tiny integrals within a disk to compute 
the Coleman integrals 
CO 
[ie 
P 


for all @ in the basis. Using our package and running the command 
julia> ColemanIntegrals(3, h, 3, 41, 2, P, :inf) 


(where respectively the arguments are, degree of the cover, x-polynomial, precision 
requested, base prime, extension degree, and endpoints) this returns: 


0 
(3+21-41! + 32-41? + O(419))-a4+ (15 + 37-41! + 37-41? + 0(413)) 
(32+ 18-41! +25. 412+ O(413))-a+ 374+ 10-41'+9.-41? + 0(413)) 
0 
0 
(27+ 10-41! + 21-417 + 0(413))-a+ G0+20-41!'+7-41? + 0(413)) 


This exactly reflects the fact that [ P — oo] is torsion in the Jacobian and that only 
the invariant differentials provide group homomorphisms from the Jacobian to Q,)2 
(forcing the images of torsion points to be trivial). Note that the basis of differentials 
is ordered by j and then i here and that the regular differentials are as described in 
Remark 4.5. 

We can also check Galois equivariance for this example, by computing that 


((-)-((f#)), 


as both evaluate to 
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0 
(38+19-41! 48.4124 O(413))-a+ (244 15-41! + 32-412 + O(413)) 
(9+ 22-41! 415-412 + O(413))-a + (10+ 35-41! + 25-412 + 0(413)) 
0 
0 
(144+ 30-41! + 19.412 + 0(413))-a+ (294+ 25-41! + 19-412 + 0(413)) 


6.2 An Elliptic Curve over a Quartic Field 


The elliptic curve with LMFDB label 4.4.725.1-16.1-al is defined over the quartic 
number field K = Q(a) where a is a root of x* — x3 — 3x +x + 1. This curve can 
be given by the model 
y* = x3 + (2a? + 6a? — 9a — 4)x* + (32a? — 8)x + (16a? + 48a? — 16 — 16). 
On this model there is a K -rational 17-torsion point with coordinates 

P = (0, 4a’). 
We work with p = 43, which remains inert in K, and fix an embedding 


K > Qy34. 


We can then compute the pair of Coleman integrals 


[> [ dx 
=S ee 
P yy JP y 


as before, obtaining the values 0 and 
(144-3143! 4 29.497 439.43? 4 0(43*)} -.c7 
4: (41 4-5 +43! 4 15-43" + 12.437 4. 014374)? 
+(11+17-43! + 30-437 4+ 11-433 + 0(434)) -a 


+ (26 + 12-43! + 28 . 437 + 28 - 433 + 0(434)), 


respectively. 
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6.3 Implementation Details 


Minzlaff has made available an open-source implementation of the algorithm in 
[20]. This includes an implementation of the algorithm of Bostan-Gaudry-Schost 
and Harvey in the Magma [10] programming language, and computes zeta functions 
of superelliptic curves over F, in ,/q time. This implementation is included in 
recent versions of Magma (as ZetaFunction) and is separately available online 
at https://github.com/mminzlaff/superelliptic. Our implementation is built on top of 
a direct translation of Minzlaff’s code into Nemo/Julia. 


Remark 6.1 There are two (related) ways to set up a Coleman integration algo- 
rithm: integrating to Teichmiiller points, or adding the tiny integrals to Frobenius. 
When using the former, checking that additivity in endpoints holds does not test 
the implementation for bugs in any serious way as when broken down, the “paths” 
integrated along form a tree. However with the latter approach, checking additivity 


in endpoints of the form 
lo) P lee) 
i @+ i. o= / (a) 
P / / 


for P, P’ in the same residue disk, checks that the local and global integrals are 
consistent, confirming that there are “non-degenerate triangles” here. 


6.3.1 A Comparison of Computer Algebra Systems 


In this section, we take the opportunity to compare and contrast programming 
languages available for working with computational number theory and arithmetic 
geometry. We discuss some similarities and differences between these computer 
algebra systems and advantages and disadvantages in the author’s experience 
between the way they behave. 

Magma and Julia are similar in several syntactic aspects which made transla- 
tion from the former to the latter easier. Both operate in a functional way, i.e. 
factor (M) rather than M. factor (). This syntax is more similar to the syntax 
used in mathematics. However it presents difficulties for introspection, that is, it 
is not always easy to find a function performing a particular task, given only a 
description of that task or its common name. In the Python-based SageMath, given 
an object M, the user can interactively type M. and press <TAB> to see a list of all 
methods associated to the object. It is a common paradigm for a user to search this 
list to find functions. In languages such as Julia and Magma where the dispatch 
system makes such functionality difficult, it is important that good searchable 
documentation exists to help users find the functions they need. For example, a 
function with a common name like Normalise when called as Normalise; 
will show a list of several possibilities of types of objects a Normalise function 
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exists for. The user must then search this list to find the description and signature 
for the one they wish to call. The Sage user would instead take their object X and 
type X.norm<TAB> to see if a normalise function exists. 

Nemo is a new package in comparison to SageMath and Magma, and so some 
common general-purpose functionality that would be expected from a general 
computer algebra system is yet to be implemented. At present this can slow down 
development of mathematical code using Nemo. However Nemo is built on top of 
the C libraries Flint, Arb, and Antic, which have been under development for far 
longer and contain significant functionality. Missing features often simply need to 
be wrapped from these libraries. Wrapping involves adding only 3-5 lines of code 
on average as Nemo is tightly coupled to these underlying libraries. It appears that 
interfacing with external C libraries is simpler when using Julia than in SageMath 
and Magma. 

Magma and SageMath are a lot more stable at present due to the maturity of these 
systems. This makes continuous integration and testing of code based on Nemo 
more vital, to prevent code from diverging when the Nemo core is modified in an 
incompatible way. 

The open-source development model and Julia’s built-in package manager also 
mitigates the immaturity problem somewhat, as it is easy to create a public fork of 
Nemo or to create a package based on it. For instance, at the time of writing there are 
at least 30 distinct Julia-based repositories on GitHub that use Nemo in some way, 
and many hundreds of independent Julia packages, some of which also containing 
functionality relevant to arithmetic geometers. This iterative and modular way of 
sharing research code is ingrained in the design of Julia and has also been proposed 
as a more lightweight development model for additional packages on top of the 
SageMath core https://wiki.sagemath.org/CodeSharingWorkflow. The advantages 
over more centralised development are a lower barrier for research code to be made 
available. It is trivial for anyone to host code in a version-controlled repository and 
create the requisite Julia package files, so that the code may then be installed using 
the built-in package system. With a more modular system, the notions of ownership 
and responsibility for individual packages are more clear, users can report bugs 
and feature requests directly to the developers of smaller modules, rather than all 
traffic going to a centralised bug tracker, or worse, a closed email list with no public 
tracking of bugs at all. 

For a working mathematician, writing high-level code that is as close as possible 
to mathematical language is preferable, for speed of development, sharing of 
research code and identification of bugs. In general, the overhead of using a high- 
level language can render many potentially practical algorithms too slow for serious 
use on large scale. For instance, algorithms such as the accumulating remainder 
tree of [11, 15] can lose their practicality when a large call overhead (such as that 
in Python) is introduced at each recursive step. This necessitates the use of low- 
level languages such as C to implement key core functionality, like fundamental 
arithmetic operations in different rings, operations with polynomials, and linear 
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algebra. When converting from high-level code to low-level, in general more 
code must be added, to explicitly construct and destruct objects, manage memory, 
iterate over lists, and handle files. This can often obscure the mathematical process 
underlying a given algorithm and obstruct understanding and later generalisation. 

In SageMath, Cython is used to mitigate such overhead problems somewhat, and 
to wrap low-level C libraries. Its Python-like syntax makes conversion easier and 
makes comprehension of code easier, and it does reduce the overhead of Python/ 
Sage code. 

Using domain-specific Julia packages to write high-level code, which is compiled 
at runtime to a fast lower-level implementation can provide a useful balance for a 
mathematical user. While there are applications where low-level control is essential: 
for instance in [9, Sec. 3.2] working at the machine level is necessary to obtain 
results on such a large scale in as small amount of time (and also therefore cost) as 
possible. Using a high-level language that is compiled (at runtime or before) into a 
lower-level language can strike a good balance between speed of the computation 
and time taken to write the code. 

The distinction between compiling at runtime vs. before the software is run is 
most apparent when experimenting with new code. Anything that can be done at 
runtime can of course be done before, but the user has more flexibility when they do 
not have to exit the CAS and recompile to take advantage of compiled code. 


6.4 Timings 


We provide here some timings of both the Magma implementation of Minzlaff and 
Julia/Nemo implementation of the underlying zeta function algorithm, which is the 
only part common to both implementations, and this uses the same algorithm due to 
Minzlaff. Hence the comparison below is really a test of how the underlying systems 
handle the operations used by this algorithm (of course further optimizations to both 
implementations may well be possible). We do not time the Coleman integration 
code here as the linear recurrence method that underlies the zeta function algorithm 
is the main component of the runtime for computing Coleman integrals also. 
Tables | and 2 contrast the times taken to compute zeta functions of superelliptic 
curves of several different types over several medium cardinality fields F,, between 
the Magma and Julia/Nemo implementations of the algorithms discussed in this 
paper. 

The dash indicates a parameter range where neither implementation applies due 
to the standing assumption on p in (2.1). 
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Table 1 Timings for the (a, b) 
Magma implementation q 25) |27) |G) (4,7) 
257 0.180 | 0.647 | 4.357 | 10.140 
521 0.253 | 0.963 | 4.373 | 10.260 
1031 0.290 | 1.017 | 6.860 | 16.543 
2053 0.413 | 1.673 | 7.080 | 16.680 
4099 0.487 | 0.960 | 7.473 | 21.530 
8209 0.750 | 1.440 | 8.407 | 21.767 
2577 0.700 | 2.743 | 21.643 | - 
521° 1.167 | 4.060 | 30.500 | 78.850 
Table 2 Timings for the (a, b) 
Nemo implementation q 25) (27) |G.) (4,7) 
257 0.197 |0.651 | 4.695 | 15.267 
521 0.366 | 1.27 4.779 | 17.892 
1031 0.415 | 1.348 | 13.287 | 43.691 
2053 0.973 | 3.727 | 14.135 | 44.516 
4099 1.133 | 2.332 | 30.306 | 114.239 
8209 3.261 | 6.486 | 36.414 | 114.61 
257° 0.339 | 1.487 | 23.569 | - 
521° 0.803 | 2.992 | 44.461 | 111.966 
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1 Introduction 


1.1 Motivation 


Databases of classical modular forms have been used for a variety of mathematical 
purposes and have almost a 50 year history (see Sect. 2). In this article, we report 
on a recent effort in this direction in the L-functions and modular forms database 
(LMFDB [60], https://Imfdb.org); for more on the LMFDB, see the overview by 
Cremona [32]. 


1.2. Organization 


The paper is organized as follows. In Sect.2, we begin with a short history, and 
we follow this in Sect.3 with a preliminary discussion of Dirichlet characters. 
Next, in Sect.4 we make more explicit what we mean by computing (spaces of) 
modular forms, and then in Sect.5 we give a short overview of the many existing 
algorithmic approaches to computing modular forms. We pause in Sect. 6 to prove 
two technical results. In Sect. 7, we sample the available implementations and make 
some comparisons. Next, in Sect.8 we discuss some computational, theoretical, 
and practical issues that arose in our efforts and in Sect.9 we explain how we 
(rigorously) computed the L-functions attached to modular newforms. Turning 
to our main effort, in Sect. 10 we provide an overview of the computations we 
performed, make some remarks on the data obtained, and explain some of the 
features of our database. Finally, in Sects.11 and 12 we treat twists and issues 
specific to modular forms of weight 1. 

As is clear from this organization, we consider the algorithmic problem of 
computing modular forms from a variety of perspectives, so this paper need not 
be read linearly. For the convenience of readers, we draw attention here to a number 
of highlights: 


¢ In Sect.2, we survey the rather interesting history of computing databases of 
modular forms. 

¢ In Sect. 3.2, we exhibit a labeling scheme for Dirichlet characters, due to Conrey. 

¢ In Theorem 4.3.4, we record formulas for the new, old, and total dimensions of 
spaces of Eisenstein series of arbitrary integer weight k > 2, level, and character, 
obtained from work of Cohen—Oesterlé and Buzzard. (Such formulas are not 
available for weight k = 1.) 

¢ In Corollary 6.1.5, we compute an Eichler—Selberg trace formula restricted to the 
space of newforms; this was used by Belabas—Cohen [4] in their implementation 
in Pari/GP. 

¢ In Tables 5 and 6, we compare the implementations of Magma and Pari/GP; in 
Table 7 we note some computationally challenging newspaces. 


Computing Classical Modular Forms 133 


¢ In Sect. 8.7, we show that by writing Hecke eigenvalues in terms of an LLL- 
reduced basis of the Hecke order, we can drastically reduce their total size. 

e In Sect.9.4, we certify analytic ranks of L-functions of modular forms and 
remark on the ranks occurring in our dataset. 

¢ In Sect.9.5, we numerically verify a generalization of Chowla’s conjecture for 
central values of non-self-dual modular form L-functions. 

¢ In Sect. 10.2, we present statistics on our data, and in Sect. 10.4 we note some 
interesting and extreme behavior that we observed in our dataset. 

¢ In Theorems 11.2.4 and 11.2.8, we exhibit simple and effectively computable 
criteria for rigorously certifying that a modular form has an inner twist. 

¢ In Sect. 12.5, we highlight some interesting and extreme behavior found among 
weight | modular forms in our database. 
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2 History 


In this section, we survey the history of computing tables of modular forms; for a 
broader but still computationally-oriented history, see Kilford [54, Section 7.1]. 


e Perhaps the first systematic tabulation of modular forms was performed by 
Wada [97, 98]. As early as 1971, he used the Eichler—Selberg trace formula to 
compute a factorization of the characteristic polynomial of the Hecke operator 
Tp on S2(To(q), x) for g = 1 (mod 4) prime where x was either trivial or the 
quadratic character of conductor g. The total computation time was reported to 
be about 300 hours on a TOSBAC-3000. 

e The next major step was made in the famous Antwerp IV tables [71] (published 
in 1975), motivated by the study of modularity of elliptic curves. Vélu and 
Stephens—Vélu computed all newforms in S2([o(V)) with N < 200 using 
modular symbols [71, Table 3] and these forms were matched with isogeny 
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classes of elliptic curves over Q found by Swinnerton-Dyer. Tingley [95] 
computed the complete splitting into Hecke eigenspaces of S2(To(N)) for 
N < 300, extending an earlier table due to Atkin. In particular he found the 
dimensions of the Atkin—Lehner eigenspaces, and computed the actual eigen- 
values as floating point numbers, numerically matching conjugate newforms. By 
integrating differentials, he also computed elliptic curves from the newforms with 
integer eigenvalues. In some cases, this computation revealed the existence of 
elliptic curves not previously found by search. (According to Birch, this was the 
case for the elliptic curve with Antwerp label 78A and Cremona label 78a1; the 
curves in its isogeny class have rather large coefficients.) 

e Extending the Antwerp IV tables, Cremona [30] (first edition published in 1992) 
computed a database of newforms in $2([o(N)) with rational coefficients for 
N < 1000, providing also a wealth of data on the corresponding (modular) 
elliptic curves. In the second edition and in later computations, this data was 
considerably extended. A more recent report [31] was made on the elliptic 
curve tables to conductor 130,000. By 2016 this database had reached conductor 
400,000, and in July 2019 Cremona and Sutherland extended it to conductor 
500,000. In this range there are 2,164,260 rational newforms, and the same 
number of isogeny classes of elliptic curves. 

¢ Miyake [67] published some numerical tables of modular forms as appendices in 
his book on modular forms; these were computed using the trace formula. These 
tables included dimensions of S;(I‘9(NV)) for k > 2 even and small values of 
N, eigenvalues and characteristic polynomials of Hecke operators on $2(I9(V)) 
for small prime values of N, and Fourier coefficients of a primitive form in 
So(To(N), xn) for N = 29, 37. 

¢ In the 1990s, Cohen, Skoruppa, and Zagier compiled tables of eigenforms in 
weights 2 through 12, levels up to 1000 in weight 2 and with a smaller range in 
higher weight; also some tables of eigenforms with non-trivial character. Their 
method followed a paper by Skoruppa and Zagier on the trace formula [87], but 
these tables were not published. 

e In the early 2000s, Stein created an online modular forms database [88], 
computed primarily using a modular symbols package [89] he implemented in 
Magma [11] starting in the late 1990s. The data was computed using a rack of six 
custom-built machines and a Sun V480; it was stored in a PostgreSQL database 
(more than 10 GB), and a (Python-based) web interface to the data was provided. 
These tables included dimensions, characteristic polynomials, and g-expansions 
in a variety of weights and levels. 

¢ Using this Magma implementation, Meyer [66] computed a table of newforms for 
I'9(N) with rational coefficients: in weight k = 2 he went to N < 3000 and for 
k=4toN < 2000. 

e Prior to our work, the LMFDB had a database of classical modular forms 
computed by Ehlen and Strémberg [42], which used the SageMath [79] imple- 
mentation of modular symbols. This dataset included partial information on 
Sx(Vo(NV)) for (k, N) in the ranges [2, 12] x [1, 100] and [2, 40] x [1, 25], and 
on Sx (1°; (4V)) in the ranges [2, 10] x [1, 50] and [2, 20] x [1, 16]. 
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The scope of our modular forms database includes all of the ranges mentioned 
above (and more), with the exception of Cremona’s tables of elliptic curves; see 
Sect. 10.1 for details. 


3 Characters 


Our database of modular forms is organized into subspaces identified by a level 
N ¢€ Zs, a weight k € Zs, and a character x: Z — C taking values in the 
cyclotomic field Q(¢y). In order to identify these subspaces and the modular forms 
they contain, we adopt a standard convention for identifying Dirichlet characters 
that is well suited to computation, the Conrey labels recalled in Sect. 3.2 below. We 
also introduce a convention for identifying Galois orbits of Dirichlet characters that 
will be used to identify the newform subspaces and newform orbits defined in Sect. 4. 


3.1 Definitions 


For N € Zs, a Dirichlet character of modulus WN is a pair (x, NV), where x: Z > C 
is a periodic function modulo WN that is the extension of a group homomorphism 
(Z/NZ)* —> C* by zero (defining x(n) = O whenever ged(n, N) 4 1)—in 
particular, x is totally multiplicative. The degree of a Dirichlet character x is the 
degree of the cyclotomic subfield Q(x) C C generated by the values of x. 

Given two Dirichlet characters x, x’ of moduli N, N’, we define their product 
xx’ to be the Dirichlet character of modulus Icm(N, N’) defined by (xx')(n) = 
x (n)x'(n). Under this definition, the set of Dirichlet characters of a fixed modulus 
N has the structure of a finite abelian group, with identity the principal (or trivial) 
character with y(n) = 1 if gcd(n, N) = 1 and x(n) = O otherwise. The order 
ord(x) of a Dirichlet character x is its order in this group, i.e., the smallest m € Zs 
such that x” is the principal character. 

Let x be a Dirichlet character of modulus N. Given a multiple N’ of N, we 
may induce x to a Dirichlet character x’ of modulus N’ by x’(n) := x(n mod N) 
whenever gced(n, N’) = 1 and x'(n) = 0 otherwise. Consequently, there is a well- 
defined minimal modulus M := cond(x) | N, called the conductor of x, such that 
x is induced from a Dirichlet character of modulus M. If cond(x) = N, 1.e., the 
conductor of x is equal to its modulus, then we say that x is a primitive character. 

It is sometimes convenient to think about Dirichlet characters without a modulus, 
remembering only a periodic, totally multiplicative arithmetic function x. In our 
context, Dirichlet characters arise from modular forms with level structure, so there 
should be little chance for confusion. 
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3.2. Conrey Labels 


We briefly describe a scheme, due to Brian Conrey, for labeling and computing 
with Dirichlet characters. Our labeling scheme can be thought of as a choice 
of an explicit isomorphism between two finite abelian groups: the multiplicative 
group (Z/NZ)” and the group of Dirichlet characters modulo N. In particular, our 
Dirichlet characters by definition take values in the complex numbers, so implicit in 
our choice of labels is a choice of embedding Q* <> C. 

For each N € Zs, we will construct a function 


xn: (Z/NZ)* x (Z/NZ)* > C* (3.2.1) 


satisfying the following three properties: 


* xn is multiplicative in each variable (separately); 

* xy is symmetric (i.e., xy (m,n) = xn(n, m) for all m,n € (Z/NZ)*); and 

* xy is nondegenerate (i.e., if xy(m,n) = 1 for allm € (Z/NZ)”*, thenn = 1 
(mod N)). 


Moreover, xy will be multiplicative in N, and hence it is sufficient to define it for 
prime powers p* and then extend xy(m, n) to general N by multiplicativity: 


xw(m,n) = | [ xpe(m,n). 


PON 


We use the notation p°||N to mean that p® | N but p°*! + N. On the left side, 
m and n denote elements of (Z/NZ)*, while on the right they denote the images 
of these in (Z/p°Z)*. We then extend xy to a multiplicative, periodic function on 
Z x Zby setting xyy(m,n) = 0 whenever gcd(mn, N) > 1. 

Under these conditions, fixing one input to xy defines a Dirichlet character 
modulo N and conversely every Dirichlet character arises in this way. Thus each 
Dirichlet character is given a unique name of the form x, (m, -) form € (Z/NZ)”. 
In particular, by symmetry, we see that yy (1, -) is the trivial character modulo N, 
and xy (m, -) is a quadratic character when m 4 1 (mod N) but m2 = 1 (mod N). 
(More generally, the order of the character xy (m, -) is the multiplicative order of m 
modulo N.) 

We now describe the construction of xy. 


Odd Prime Powers Let p be an odd prime. Let g be the smallest positive integer 
that is a primitive root mod p® for all e > 1. (This is almost always the same as 
the smallest primitive root mod p, but may not be; the only odd prime less than a 
million for which these differ is 40487.) For m € (Z/p°Z)*, we define log,(m) € 
Z/¢(p°)Z by the condition 


m = g's (mod p*), (3.2.2) 
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so that log, : (Z/p°Z)* — Z/$(p*)Z is an isomorphism of groups. 
For m,n € (Z/p°Z)*, we then define 


log, (m) log, (n) 
Xpe(m, n) := exp (201 ee) (3.2.3) 
p(P*) 
Then xX pe clearly satisfies the three required conditions (multiplicative, symmetric, 
and nondegenerate). 


Powers of 2 We define x2 to be the trivial map (so x2(1, 1) = 1), and define 
xa(m, n) = (DO VEDE (3.2.4) 


for m,n € (Z/4Z)*. Let e > 3. The group (Z/2°Z)” is generated by 5 and —1. For 
m & (Z/2°Z)*, we define €(m) € {0, 1} and logs(m) € Z/2°~°Z by 


m = (-1)65'°85) (mod 2°) (3.2.5) 


so that now (e, log;): (Z/2°Z)* — Z/2Z x Z/2°~*Z is an isomorphism. For 
m,n € (Z/2°Z)*, we then define 


(3.2.6) 


X2e(m, n) = exp (2 ie 2 


Je—2 


As for the case of odd prime power modulus, this function satisfies the required 
properties. 


In this article, as in the LMFDB, the Conrey label of the character xy (m, -) 
has the form N.m. For example, the Conrey label of x7(6, -), the unique quadratic 
character of modulus 7, is 7.6. 


3.3 Orbit Labels 


There is an action of the absolute Galois group Galg := Gal(Q* | Q) of Q on the 
set of Dirichlet characters of modulus N, defined by 


(ox )(n) = a(x (n)) (3.3.1) 


foro € Galg andn € Z. 

It is natural to organize characters by Galois orbits, and indeed we will also 
want to work with modular forms defined without an embedding into the complex 
numbers, specified up to the action of Galois (see Sect. 4.2). So we also assign an 
orbit label to each Galois orbit of Dirichlet characters, as follows. To choose this 
label we lexicographically order the sequences 
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ord(y), Tr x (1), Tr x (2), Tr x (3), Tr x (4), .-. 


of integers, where Tr: Q(x) — Q is the absolute trace; we then assign the label 
written in base 26 using the letters of the alphabet, so 


a,b,...,2Z,ba,bb,...,bz,ca,...,2z,baa,.... 


For every modulus N > 1, the Dirichlet character orbit N.a is the trivial 
character, since it is the unique character with (smallest) order 1. 


Example 3.3.2 The table below lists the Conrey labels of the eight Dirichlet 
characters of modulus 20, their values on the generators 11 and 17 of (Z/20Z)*, 
their orders, the absolute traces of their values the first five positive integers coprime 
to 20 (note Tr(x (n)) = 0 if gcd(20, n) ¥ 1), and the labels of the six Galois orbits 
in which they lie. 


Conrey Orbit 
label x (11) | x(17) | ord(x) | Tr(x (1) | Tex (3), TH (7)) | Tr (1) | Tr(x (13) | label 
20.1 |1 1 1 1 1 1 1 1 20.a 
20.11 | —1 1 2 1 =] =I 1 -1 20.b 
20.9 | 1 -1 2 1 -1 =i 1 1 20.8 
20.19] —1 —-1 2 1 1 1 1 =1 20.d 
20.3 |—1 —i 4 2 0 0 =2 =); 20.e 
20.7 |—1 i 4 2 0 0 —2 —2 20.e 
20.13} 1 -i 4 2 0 0 =2 2 20..£ 
20.17} 1 i 4 2 0 0 =2 2 2068 


Remark 3.3.3 The field Q(x) is contained in the coefficient field Q(f) of a 
newform f with character x. When the dimension of Q(/f) is large it may be 
difficult to compute a complex embedding Q(f) — C, and we often need to 
distinguish embeddings that are compatible with the Hecke action, which means 
we must know the image of Q(x) under embeddings of Q(f). Matching up roots 
of unity of large order can be surprisingly nontrivial! So when computing the 
coefficient field (as an abstract field, not necessarily embedded in the complex 
numbers), we compute the values of x on generators for (Z/NZ)* as elements 
of the coefficient field. In this way, we may organize embeddings of the coefficient 
field according to a desired embedding of Q(x). 

We could instead keep track of the coefficient field as an extension of Q(x), but 
that approach creates headaches when comparing results across implementations, it 
shifts the problem to a different place when working with forms in a Galois orbit, 
and it does not allow us to represent eigenvalues in terms of a nice LLL-reduced 
basis (see Sect. 8.7). 
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4 Computing Modular Forms 


In this section, we make precise what it means to compute modular forms. For 
background, we refer to the wealth of references available, for example Cohen— 
Stromberg [25], Diamond—Shurman [38], Serre [82, Chapter VII], and Stein [89]. 


4.1 Setup 


The group SL2(R) acts (on the left) by linear fractional transformations on the 
upper half-plane H := {z € C: Imz > O}. For N € Zs, define the congruence 
subgroups 


To(N) := {y €SLx(Z): y= (; ‘) (mod mh, 


(4.1.1) 
T1(N) := {y €SL)(Z): y= (( ) (mod | 
For T < SL»)(Z) a congruence subgroup, the quotient Y(T) := T\H can be 
compactified to X(I°) by adding finitely many cusps, identified with the orbits 
of T on P!(Q). As usual, we write X(N), X1(N) for the quotients X(C) with 
P=Po(N), TiC). 
Fork, N € Zs, a modular form of weight k and level N is a holomorphic function 
f: H — C that is bounded in vertical strips and satisfies 


az+b _ k 
f (=**) = (cz + d)* f(z) (4.1.2) 


for all y = (: i) € T\(N); the C-vector space of such forms is denoted 


cd 
M1 (N)). 
Modular forms are organized by character, as follows. The space M;(T\(N)) 
decomposes according to the action of diamond operators as 


Mk(Ti(N)) = DB Me(Po(W), x), (4.1.3) 
x 


the sum being over all Dirichlet characters x: Z/NZ — C of modulus N, where 
My(To(N), x) is the subspace of modular forms with (Nebentypus) character x 
consisting of those forms f satisfying 


az+b _ k 
f (=**) = x(d)(cz + d)* f (z) (4.1.4) 
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for all y € 'o(N). Throughout, we will abbreviate M;, (To(V), x) to Mx (N, x) and 
when x is trivial, write simply M;(N). 

In order to handle character values with some finesse (as explained above in 
Sect. 3 and below in Sect. 4.2), we work in the absolute situation (relative to Q) and 
consider the entire Galois orbit [x] of x, and so we write 


Mi(VoWN), [x)= QB Mx(To(N), x’), (4.1.5) 
x'elx] 


so that from (4.1.3) we have 


MkTi(N)) = GB Me(Po(N), [xD 
[x] 


where the direct sum is now over Galois orbits of characters [x]. We similarly 
abbreviate M; (To(NV), [x ]) to just Mx (N, [x]). 
Every such modular form f has a g-expansion (i.e., Fourier expansion at co) 


[ee 


f (2) =o ang” € Cll (4.1.6) 


n=0 


where g = exp(2miz) and z € H. We call a, € C the coefficients of f, and we write 
Zl{an}n] for the coefficient ring and Q({an},,) for the coefficient field of f, the subring 
and subfield of C generated by its coefficients, respectively. 

A modular form f is a cusp form if f vanishes at the cusps of X;(N). The 
subspace of cusp forms is denoted S,(T1(NV)) GC Mz(11CN)), and similarly 
Sk(To(V), x) © Mi(Vo(N), x). In particular, a cusp form vanishes at the cusp 
oo, so that the coefficient ap of its g-expansion is zero. 

The Petersson inner product provides an orthogonal decomposition 


Mx(To(N), x) = Sk(Po(N), x) @ Exo), x) (4.1.7) 


where Ex(Io(N), x) is the space spanned by Eisenstein series, obtained in an 
explicit way using characters (see Sect. 4.4). Each of the spaces above can further 
be decomposed into old and new subspaces, and we denote the new subspace by 
Se (11 (N)), ete. 

The above spaces can be equipped with an action of Hecke operators T,, indexed 
by n € Zs . The operators T,, are normal and pairwise commute for gcd(n, N) = 
1, so there is a common normalized (a; = 1) basis for the action of the Hecke 
operators, called eigenforms; for such forms, T, f = an f for f as in (4.1.6). A new 
cuspidal eigenform is called an (embedded) newform. The coefficients of a newform 
are algebraic integers and the coefficient field is a number field. When x is trivial, 
this coefficient field is totally real. When x is trivial, we also have Atkin—Lehner 
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involutions W, for p | N, and the Fricke involution Wy := II pIN W>. (See Sect. 8.3 
below.) 

For a subring A C C, we write M;(T1(N); A) C My(T,(N)) for the A- 
submodule of modular forms whose qg-expansions have coefficients in A, and 
similarly with the other decorated spaces. 

From now on, we suppose we are given the input of a weight k € Zs, a level 
N € Zs}, and an orbit of Dirichlet characters x of modulus N and orbit label N.s; 
we encode this data of a space of modular forms in the label N.k.s. 


Example 4.1.8 For N = 280, k = 2, and trivial character x having label 280.a, the 
space M2(280) = M2(T0(280)) has label 280.2.a. 


Remark 4.1.9 We restrict ourselves to integral weight forms in this article. For 
forms of half-integral weight, the algorithms, applications, and issues that arise are 
quite different. 


4.2 Galois Digression 


As is usual in Galois theory, it is convenient to work both with abstract objects as 
well as embedded objects. To this end, we call the Aut(C)-orbit of an embedded 
newform f a newform orbit, and write [f] for this orbit. We call a Q-subspace of 
aaa Uh o(N), [x]; Q) that is irreducible under the action of the Hecke operators a 
newform subspace. 

For an eigenform /f in a newform subspace, we obtain an embedded 
newform by a choice of embedding of its coefficient field into C, and all 
such embeddings are conjugate under Aut(C). Conversely, given an embedded 
newform f € S?(To(N), x), the C-subspace of Sx(Fo(N), [x]) spanned by 
o(f) := ¥,o(an(f))q" for o € Aut(C) descends to a newform subspace 
Ve S Sk(Vo(N), [x]; Q), visibly depending only on the Aut(C)-orbit of f. In other 
words, there is a bijection between newform subspaces V C S/*“(I')(N)) and 
newform orbits [ f] of embedded newforms f of weight k and level N. 

The coefficient field K of a newform subspace, defined to be the coefficient field 
of any eigenform in the subspace, is well-defined as an abstract number field. The 
expansion (4.1.6) considered in K, is similarly well-defined. 


4.3 Dimensions 


The first thing one may ask to compute for a space of modular forms is just dimen- 
sions of the subspaces as defined above: the total dimension dime M;(To(N), [x]), 
the dimension of the Eisenstein subspace dime E;(Io(N), [x ]), and the dimension 
of the cuspidal subspace dime Sx (9 (NV), [x ]), as well as the old and new subspaces 
of each of these. Since these subspaces are naturally vector spaces over Q, we have 
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dimc Mx(Po(N), [x]) = dimg Mx (To), [x]; Q): 


moreover, an individual space M;([o(N), x) is a vector space over Q(x) and each 
summand in (4.1.5) has the same dimension, so these absolute dimensions are the 
product of their relative dimension by the degree d = [Q(x) : Q] of x, i-e., we also 
have 


dimc Mx(To(N), [x]) = dimayx) Me (To(N), x; Q). 


Remark 4.3.1 To avoid errors, to compare across packages, and to store data 
conveniently, we found it essential to compute in the absolute setting (over Q) rather 
than the relative setting (over Q(x)). 


For weight k > 2, these dimensions can be computed using the valence formula, 
the Riemann—Roch theorem, or the trace formula—they are given explicitly e.g. 
by Cohen-Strémberg [25, Theorem 7.4.1]. Unfortunately, no formula is known for 
these dimensions when k = 1. 

Because they can be understood explicitly in terms of Dirichlet characters, there 
are separately given formulas for the Eisenstein dimension as well as the dimension 
of the new and old subspaces in all weights k > 1: see Cohen—Strémberg [25, 
Propositions 8.5.15 and 8.5.21] for the full dimension, with the new dimension 
worked out by Buzzard [17] using a formula of Cohen—Oesterlé [24, Theorem 1] 
as follows. Lacking a reference for these formulas, we record them here. 

For r,s, p € Z with p prime andr > O ands < r, define 


p’t+p", if2s<r=2r’; 


AGS, p) = 4 2p" ; if2s<r=2r' +1; (4.3.2) 
2p, if2s>r; 

and 

2 r=Sy, 

2p—4 if2s >rand jr=s5+1; 

2(p — 1)?p" 5? re s+ 2; 

0 p=2; 
Anew(1, 8, P) = 4 p—3 if 2s =rand }r = 2 and p > 3; 

(p — 2)(p— Ip? r>4; 

0 2¢r; 

p-—2 if2s <rand jr =2; 

@ ip? r>4and2|r. 


(4.3.3) 
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Theorem 4.3.4 (Cohen-Oesterlé, Buzzard) Let N,k € Zz and let x bea 
character of modulus N and conductor M | N. Then the following statements 
hold: 


(a) If x(-1) 4 (— 1, then dime Ex(N, x) = dime ER" (N, x) = 0. 
(b) For N = 1, we have 


1, ifk>4and2\|k; 


dimc Ex (1) = dimc E71) = (4.3.5) 
0, otherwise. 
Suppose further that N > 2 and x(—1) = (—1)*, and let 
e:= | | A(ord,(), ord, (M), p) 
PIN 
(4.3.6) 
€new *= I] Anew (Ord y(N), ord, (M), p). 
p|N 
Then the following hold: 
(c) We have 
e—1 ifk =2and x is trivial; 
dimc Ex(N, x) = 4 e/2 ifk =1; (4.3.7) 
e otherwise. 
(d) We have 


Cnew t1  ifk =2 and x is trivial and N is prime; 
dime ER"(N,X) = }enew/2 fk = 1: 


enew otherwise. 
(4.3.8) 
(ce) We have 


dime Ex(N,[x]) = ddimc Ex(N, x) 


where d = [Q(x) : Q] is the degree of x, and similarly with dimc E?““ (N, [x]). 


Proof The proof is an elaborate and rather tedious exercise in counting characters 
using the trace formula. Oo 
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Table 1 Dimensions for Total |New | Old 


pubspacesat MoO TD Modular forms |408 196 | 312 


Cusp forms 360 | 96 264 
Eisenstein series | 48 |0 48 


Example 4.3.9 We consider the space M3(560, [x ]) with label 560.3.bt; a character 
x in this orbit has label 560.bt, order 6, and degree 2. We then compute dimensions 
as in Table 1. 


One can also ask for the full trace form 


Yo Te(Tn | SKN, LxD)9" € SKN, Lx: Z) (4.3.10) 


n=1 


on S;(N, [x]) to some (q-adic) precision, with analogous definitions for the other 
subspaces considered above; see also (4.5.2) below. 


4.4 Eisenstein Series 


Beyond dimensions, we may next ask for further information about the decom- 
position of the space M;(N, x). Of course the first step is the decomposition of 
the Eisenstein subspaces Ex (N, x)—for this purpose, explicit bases are given by 
Cohen-Strémberg [25, Theorems 8.5.17, 8.5.22, and 8.5.23]. 


Remark 4.4.1 We do not currently display an Eisenstein basis in the LMFDB. 


4.5 Decomposition of Newspaces into Hecke Orbits 


With the Eisenstein subspace described explicitly above, we now turn to the cuspidal 
subspace. By the newform theory of Atkin—Lehner [1] and Li [59], the multiplicity 
of the space Se*(M, xm) in SCN, x), is equal to the number of divisors of 
N/M (so depends only on the conductor and level). While it suffices to study 
the new subspace, it may be computationally expensive to determine S?°“(N, x) 
as a subspace of S;(N, x); one way to do this is via projection operators called 
degeneracy maps, one for each prime divisor of NV. 

At this stage, for each newspace Sew (N ,[x]) we may first ask for just the 
dimensions of its newform subspaces V or Hecke orbits—see Sect. 8.5 below for 
a discussion of decomposition and irreducibility. When x is trivial, we may also 
ask for the decomposition of the space under the Atkin—Lehner involutions and the 
Fricke involution. 
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Table 2 Dimensions for 3 | 17 | 61 | Fricke | Dimension | Decomposition 
subspaces of S3°” (3111) j B 1424349 

29 29 

24 3421 

14 14 

24 24 

14 14 

13 13 

ae ey el 28 28 


Example 4.5.1 The space S}°” (3111), with trivial character, has dimension 159; it 
decomposes into newspaces of dimensions 1+ 2+3+3+7+13+ 144+ 14421+ 
24+ 28+ 29 = 159, as can be seen in Table 2, which also shows the decomposition 
into subspaces under the Atkin—Lehner operators. 


In practice, one computes this decomposition as follows. We first compute a 
Q(x)-basis for Sp (N , X) in some manner, and then we compute the matrix of 
T, on this basis for p { N for a few primes p in such a way that a small (finite) Z- 
linear combination >> p&p! p has squarefree characteristic polynomial. Therefore, 
the Q-dimension decomposition is simply the degrees of the irreducible factors each 
multiplied by [Q(x) : Q]. (There seems to be no problem in practice finding such a 
small linear combination, but the best thing that we can say rigorously involves the 
Sturm bound and appears to be far from optimal.) Already at this point engineering 
concerns enter: for example, the time to compute such a characteristic polynomial 
may be faster in certain implementations if done over Q instead. 

With this basic decomposition data in hand, we may continue. For each newform 
orbit [f] < V (cf. Sect. 4.2) we wish to compute the following: 


(1) The trace form 


Tr(f)(q) = > Trejo(an(f))q” € Sk(N, Lx: Z) (4.5.2) 


n=1 


(well-defined on the Galois orbit [f]), where K is the coefficient field of f, 
to precision n up to the Sturm bound (see Sect. 8.2). Equivalently, writing 
Tr(f)(q) = mq" € ZIlg]l, we have t, = Tr(T, | V) as the trace of the 
Hecke operator T,, restricted to V—see Sect. 8.6 for further discussion. 

(2) A minimal polynomial for the coefficient field K of [f]. 

(3) A finite set of generators for the Hecke kernel for V, the ideal in the Hecke 
algebra on S/'°“(N, x) that vanishes on V; i.e., a finite set of polynomials in T,, 
such that the ideal generated by these polynomials cuts out exactly V. (We use 
the Hecke kernel when computing inner twists: see Sect. 11.) 


Although it is possible to compute coefficients of the trace form Tr(f) by 
computing coefficients of f and taking traces, this is more expensive than other 
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techniques and is not computationally feasible in many cases where it is feasible 
to compute the trace form (e.g., using the trace formula: see Sect.5.2). The trace 
form conveniently records interesting information about the newform orbit, e.g., the 
coefficient ¢; of the trace form is equal to the dimension of the newform subspace. 


Example 4.5.3 Consider the space S2(1166, [x ]) with label 1166.2.c, the character 
having order 2 and conductor 53 | 1166. The old subspace decomposes as 


S2!4(1166, [x]) ~ S8°* (53, [x1)®* @ S3°¥ (106, [x])? @ Sh (583, [x])®. 


The decomposition of the new space S5* (1166, [x]) into irreducibles by Q- 
dimension is 46 = 2 + 22 + 22, giving rise to three newform orbits 1166.2.c.a, 
1166.2.c.b, 1166.2.c.c with respective trace forms 


Tr( fa)(q) = 2g — 2q4 + 2q° — 8q7 + 4g? + O("*) 
Tr(fo)(q) = 22q — 22q* — 6q° — 24q° + O(q!) (4.5.4) 


Tr(fe)(q) = 22g — 22q* + 4q° + 8g’ — 34g? + O(g"). 


We computed the last two trace forms without computing coefficients of a con- 
stituent newform belonging to a number field of degree 22, or even determining 
what this number field is, which would have been much more time consuming. 
For the newform orbit [f,], we determined that its coefficient field is Q(/—]), 
that it can be constructed as the kernel of the linear operator Te + 1 acting on 
Nee @ 166, [x ]), and then computed the first 1000 coefficients a, of its g-expansion 


> ang” as elements of Q(./—1). 


4.6 Hecke Eigenvalues 


Finally, for a newform f, we can ask for the coefficients of f up to (at least) the 
Sturm bound. These coefficients can be represented either exactly or as complex 
numbers (approximately, e.g. using interval arithmetic). 


e For exact coefficients, there are issues in representing them compactly: see 
Sect. 8.7 for our approaches. 

e For the numerical (complex) coefficients a,, the most useful for computing L- 
functions (see the next section), we ask for these coefficients for each embedded 
form in the newspace. These coefficients are of size O(n*—Y/2+) for alle > 0, 
so in large weight we prefer to compute the normalized coefficients a, /n‘*—))/?, 
which by the Ramanujan—Petersson bounds have absolute value of size O(n‘). 


For large degree coefficient fields, it is often practical to compute numerical 
coefficients even when storing exact coefficients would be impractical. 
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Table 3 Embedded newforms for 5355.2.a.bf 


Label | tm (v) a2 a3 |\a4 as a6 |\a7 ag 

14 2.65109 —1.65109 |0 | 0.726109 1.00000 | 0 1.00000 | 2.10331 
1.2 —0.273891 | 1.27389 0 | —0.377203 | 1.00000 | 0 1.00000 | —3.02830 
1a —1.37720 | 2.37720 0 | 3.65109 1.00000 | 0 1.00000 | 3.92498 


Finally, when the character is trivial, we can also ask for the signs of the Atkin— 
Lehner involutions (as can be seen in Table 2, for example). 


Example 4.6.1 Consider the newform orbit 5355.2.a.bf of dimension 3, with 
coefficient field Q(v) (LMFDB label 3.3.169.1) where v is a root of the polynomial 
x? — x* — 4x — 1. The g-expansion of a newform f in this orbit, with coefficients 


in Q(v), is 
fq) =4+(— iq? + (2- Bi + B2)q4 +9? +4’ + (2— Bi +2f2)9q8 + O(g"”) 


where f; = v and B2 = v* — v —3. 

The 3 embedded newforms are labeled 5355.2.a.bf.1.m form = 1, 2, 3 encoding 
the three embeddings t,, : Q(v) — C; the embedded coefficients to 6 decimal digits 
are shown in Table 3. 


4.7 L-Functions 


We can also ask for computations related to (invariants of) L-functions of modular 
forms, including the sign of the functional equation, the first few zeros, and special 
values to some precision: see Sect. 9 for more detail. 


5 Algorithms 


In this section, we give a brief overview of different algorithmic methods to 
compute modular forms and indicate where they are currently implemented. In our 
computations for the LMFDB, we only used the first two (modular symbols and the 
trace formula), but here we also survey the others. Our goal is to give a flavor of what 
each method entails, referring to the references provided for details. Throughout, we 
keep notation from the previous section. 
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5.1 Modular Symbols 


The most well-known method to compute modular forms is the method of modular 
symbols, introduced by Birch [6] and developed by Manin [61], Merel [64], Stein 
[89], and many others. For an extensive history, see Stein [89, 8.10.2], and for a 
gentle overview see Stein [90]. This method was implemented in Magma [11] by 
William Stein, with contributions by Steve Donnelly and Mark Watkins, and in 
SageMath [79] by William Stein, with contributions by David Loeffler, Craig Citro, 
Peter Bruin, Frédéric Chapoton, Alex Ghitza, and many others. 

We now briefly introduce modular symbols. Assume k > 2. Integration gives a 
perfect pairing 


Sk (N)) x Ai (X(N), REx, yle-2) > C 


(5.1.1) 
(f,v®P)we i) f@) P«, I dz 


where R[x, y]x—2 denotes the R-vector space of homogeneous polynomials of 
degree k — 2. In a slogan, (5.1.1) indicates that the homology of a modular curve is 
dual to its cusp forms, and this is formalized as follows. Let Div(P! (Q)) be the free 
abelian group on symbols [a] for a € P!(Q), and let Div?(P'(Q)) < Div(P!(Q)) 
be the subgroup of degree zero elements under the natural degree map. Then 
Div? (P! (Q)) is generated by elements {a, 8} := [a]—[f] for a, B € P'(Q), written 
this way to suggest a path from a@ to B in C. We define the space of modular symbols 
of weight k and level N (with Q-coefficients) to be the quotient 


QLx, ylk-2 ® Div? (P!Q)) 


ModS I (N); = 
RSE eS ee B= 9 OP Oe, Bin sealer 


under the natural action of [)(N) < SL2(Q). The space ModSym, (11 (NV); Q) 
of modular symbols has moreover a natural action of Hecke operators and Atkin— 
Lehner operators. 


Theorem 5.1.2 There is a Hecke-equivariant isomorphism 

ModSym, (11 (N); Q) > MeCN); Q) ® Sei); Q) 
where Sx(T\(N); Q) denotes the space of anti-holomorphic cusp forms, the image 
of SK(11(n); Q) under complex conjugation. 


Proof See Manin [61], Merel [64], or Stein [89, $8.5]. oO 


Theorem 5.1.2 has many variants: one may restrict to [9(V), work with the 
(appropriately defined) space of cuspidal modular symbols as the kernel of a certain 
boundary map, carve out just My (1, (NV); Q) as the +-space for a natural action of 
complex conjugation, and so on. 


Computing Classical Modular Forms 149 


Example 5.1.3. For '9(V), the space of modular symbols has a convenient descrip- 
tion in terms of Manin symbols as follows: ModSym, (Mo(N); Q) is the Q-vector 
space generated by the set A of elements 6 = Ga (c : d)) fori = 
0,...,k —2and (c: d) € P'(Z/NZ), modulo the subspace 


(6+ 58,6 +5R+5R7)5ca 


where S = ( y s) and R = ( ¥ ) The Hecke operators do not preserve 


individual Manin symbols, but there is an efficient procedure (arising from the 
Euclidean algorithm) for reducing an arbitrary element of ModSym,(To(V); Q) 
to a linear combination of Manin symbols. 


One feature of modular symbols is that they are especially well-suited for certain 
applications, including arithmetic invariants of elliptic curve quotients [31] (and 
more generally modular abelian varieties) as well as L-values of modular forms 
(see e.g. Sect.9.4 below for an application). Moreover, modular symbols can be 
employed for arbitrary congruence subgroups (see [3] for an example). 

In practice, it is quite efficient to compute the space of modular symbols with its 
Hecke action. It is a matter of sparse linear algebra to compute a basis of modular 
symbols, a negligible contribution. The number of field operations to compute the 
action of the Hecke operator 7, on this basis is O (nd), where d is the Q(x)- 
dimension of the space under consideration: for each of the d basis elements, we 
sum the action of o1(n) := > abe d=O (n) cosets and reduce to the basis in time 
polynomial in logn using continued fractions. In this way, we may compute the q- 
expansions of a basis to precision O(q’) using O (dr?) field operations, and thereby 
also the trace form. 

The most difficult engineering effort that goes into a working implementation 
of modular symbols is the careful handling of linear algebra aspects: we apply 
degeneracy operators to obtain precisely the subspace S;““(N, x), and once the 
matrices [7,,] representing the Hecke operators are computed on this space, we 
compute its decomposition into newform subspaces, etc. Indeed, in the preceding 
paragraphs, the actual time complexity of this method may depend on the output 
desired and the meaning of “arithmetic operation”. If we wish for exact results, 
which is the approach taken by Magma and Pari/GP, then we need to do exact 
arithmetic with elements of cyclotomic fields, and the larger the order of the 
corresponding Dirichlet character, the more expensive the computation. Similarly, 
the coefficients of the newforms themselves may live in a large extension of the field 
of character values, and the larger this extension is, the harder the computation. 


Remark 5.1.4 As alternatives, we may do all of the computations described using 
floating point approximations to complex numbers, for example using complex ball 
arithmetic to compute rigorous error bounds for all of the output. In this case, 
the degree of the field of coefficients of the modular form is irrelevant, and the 
time complexity matches the estimates above; this is particularly attractive if our 
application is to the computation of Dirichlet coefficients for input into L-function 
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computations. Similar comments apply when performing computations over a finite 
field, for example, working with coefficients over a finite field with prime cardinality 
congruent to 1 modulo the order of x—in this case, we can do all computations 
over F,. In both cases, we must do some reconstruction to obtain exact results in 
characteristic zero. 


The above description requires weight k > 2. For weight 1, there are two 
approaches that reduce the problem to higher weight. In the approach originated by 
Buhler [14], further developed by Buzzard [18], and carried out to scale by Buzzard— 
Lauder [19], we choose nonzero f € M;(T1(N)) and consider S)(T1(N)) C 
f7'Myi1(11(N)). Intersecting the spaces obtained for many choices of f, we 
quickly obtain an upper bound for the space $;(I"1(V)) that can then be matched 
with a lower bound. Using Buzzard’s code, this method was implemented in Magma 
by Steve Donnelly. (Currently, Magma can provide a basis for the cuspidal subspace, 
but it does not decompose the space into the old and new subspace and does not 
provide the action of the Hecke operators; this was implemented by Buzzard— 
Lauder, but has not yet been incorporated into Magma.) A second related approach 
is to use the Hecke stability method of Schaeffer [80], instead computing the largest 
subspace of f —1 My 1(7,(N)) that is stable under the Hecke operators; this has 
been implemented in SageMath by Schaeffer and Loeffler, and in Pari/GP by Belabas 
and Cohen [4, §4]. 


5.2 Trace Formula 


Perhaps the earliest method to compute modular forms used the trace formula. The 
trace formula is an explicit formula for the trace of a Hecke operator acting on a 
space of modular forms, and it was pioneered by Selberg [81] and later developed 
by Eichler [40], Hijikata [47], and Cohen—Oesterlé [24]. A comprehensive treatment 
with references is the book of Knightly—Li [58], and a tidy presentation is given 
by Schoof—van der Vlugt [85, Theorem 2.2]. Proofs of the trace formula from 
different vantage points continue to be developed, see e.g. Popa [74]. This method 
has been implemented in Pari/GP [72] by Belabas—Cohen [4] and in a standalone 
implementation by Bober, described in Sect. 7.2. 

We again assume k > 2. An explicit version of the trace formula for 
Tr(Tn | Sk To(V), x)) € Q(X) is too complicated to give here. Aside from easily 
computed terms, it can be naively understood as a weighted sum of (Hurwitz) class 
numbers of imaginary quadratic fields: for a precise statement, see e.g. Belabas— 
Cohen [4, Theorem 4]. We obtain Tr(T;, | SP (N , X)) from a nontrivial application 
of the Mébius inversion formula, proven in Corollary 6.1.5 below. 

Letd := dimgy,) S?(N, x) = O(KN). The computation of Tr(T;, | S?° (N, x)) 
requires computing class numbers of imaginary quadratic fields with absolute 
discriminant up to O(n), and one can compute all of these at once in time 
complexity O(n3/2). For the purposes of a large-scale computation, these class 
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numbers are cached and may be assumed to be precomputed (their cost amortized 
over many computations, thereby negligible). Under this assumption, and given 
factorizations of n and N, to compute Tr(7;, | Sp (N ,X)) we sum O(,/n) terms 
giving a complexity of O(./nN‘*) field operations for any « > 0; computing all 
traces up ton > d then takes O(n3/ *) field operations. 

In this manner, we compute the relative trace form on the new cuspidal subspace 


(oe) 


1(q) = D> Tr(Tn | SP CN, x))q" € SEWN, x3 ZLxD), (5.2.1) 


n=1 


and from this we quickly compute the full trace form (4.3.10) in Se 1(N); Z). 
In particular, using the trace formula method we can compute either trace form 
to precision O(q’) using O(r3/2N¢£) field operations, which for r > d becomes 
O(r3/) as in the previous paragraph. 

By multiplicity one theorems, and since the Hecke operators act semisimply 
on the newspace, the images of f under the Hecke operators span S7°(N, x). 
Explicitly, applying 7, to t, we obtain 


Cig =>— nT, (Se Na" (5.2.2) 


n=1 


and the forms Tit, 72t,... span S?°*(N, x). (We recall that TT, = Tnn when 
gcd(m,n) = 1, and more generally a recursion for the Hecke operators applies. 
Therefore, these coefficients can again be expressed in terms of traces of Hecke 
operators.) Once we have a spanning set, we can extract a basis and apply Hecke 
operators to that basis. 

Typically (in practice) we need O(d) forms to span and O(d) coefficients of 
each form to get a full rank matrix. Thus writing down a basis typically requires 
the first O(d) values of Tr(Th | Sew (N, x)), which can be computed using O(d?) 
field operations. Finding this basis—and the g-expansion to precision O (d) for each 
form—is standard linear algebra, accomplished using O(d) field operations. To 
compute the matrix of the Hecke operator J; on this basis requires traces up to 
O(nd) and so O (n3/*d3/*) operations. Finally and similarly, to compute a basis of 
q-expansions to precision O(q’) with r > d, we compute traces up to O(rd) and 
apply a change of basis, for a total of O(a! ?,3/) arithmetic operations. 

We summarize the estimated complexity of these two approaches in Table 4, 
where again d is the Q(x)-dimension of the space under consideration and we 
suppose precision r > d. 

So although linear algebra eventually dominates both approaches, neither mod- 
ular symbols nor the trace formula seems to be a winner for all tasks: it seems to 
be much better to use modular symbols to get information about a small number 
of Hecke operators, while it is much better to use the trace formula to get a large 
number of coefficients of a basis of newforms. This heuristic analysis matches our 
practical experience in the course of our computations. 
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Table 4 Heuristic complexity of modular form computations 


Task Modular symbols | Trace formula 
Full trace form to precision O(q'),d= O(r) O(dr?) O(r3/2) 

[T,] on a basis O(dn) O(d3/2n3/2 +d?) 
Characteristic polynomial of 7, on a basis O (dn + d3) O (d3/?n3/? + d3) 
Basis of g-expansions to precision O(q"), d = O(r) O(dr?) 0(d3/273/2) 
Hecke decomposition O (d3) O (d3) 

Minimal polynomials for newspace coefficient fields O(d3) O(d) 


Similar comments with reference to weight | forms apply as in the previous 
section. The same is true for the issue of time complexity and the coefficient 
field (see e.g. Remark 5.1.4), with the caveat that the matrices representing Hecke 
operators using modular symbols tend to be much sparser in comparison to those 
using the trace formula. In particular, one expects that taking advantage of sparsity 
will allow a more efficient implementation of the linear algebra aspects for modular 
symbols. 


Remark 5.2.3 In some circumstances, it can be more convenient to work with a 
basis that is in echelon form with respect to g-expansions (sometimes called a Victor 
Miller basis) in the trace formula method. With such a basis, going back and forth 
between an action on g-expansions and the matrix form for various linear operators 
one can see some gains in efficiency. 


5.3 Definite Methods 


In both of the previous algorithms, we work (either explicitly or implicitly) on the 
modular curve. In this section, we indicate another class of algorithms that compute 
systems of Hecke eigenvalues using a different underlying object. 

Going back at least to Jacobi, surely the first modular forms studied were theta 
series. Let O(x) = Q(x1,...,%q@) € Z[x1,..., xa] be a positive definite integral 
quadratic form ind = 2k € 2Z;, variables with discriminant N. Let P(x) be a 
(nonzero) spherical polynomial of degree m > 0 with respect to Q, for example 
P(x) = 1. We form the generating series for representations of n € Zso by Q 
weighted by P, a theta series of Q, by 


60,P(q) = D> P(x)q2™. (5.3.1) 
xeZd 
For example, if P(x) = 1, then 


[ee 


60,1(q) = > ro(n)q” € ZIIqll (5.3.2) 
n=0 
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where rg(n) = #{x € UF s Q(x) = n} counts the number of representations of n 
by Q. By letting gq = e*”% for z € Has usual, we obtain a holomorphic function 
69: H — C-. Further, by an application of the Poisson summation formula (see 
e.g. Miyake [67, Corollary 4.9.5]), we find that 6p,g € Mkim(To(2N), xn) is a 


N 
classical modular form of weight k + m, level 2N, and character yy (a) := (= 
a 


of order at most 2. 

Turning this around, we can use theta series to compute spaces of classical 
modular forms. Perhaps the most convenient source of such theta series is to work 
with quaternary (d = 4) quadratic forms of square discriminant coming from 
quaternion algebras—this method goes by the name Brandt matrices as it came 
about from early work of Brandt. Building on work of Eichler [41], Hijikata—Pizer— 
Shemanske [48] proved that linear combinations of such theta series span the space 
of cusp forms, up to twists. (See also Martin [62] for a more recent development.) 
The coefficients of theta series can then be reformulated in terms of classes of 
right ideals of specified reduced norm in a quaternion order. This method was 
first developed in an algorithmic context by Pizer [73]; it has been implemented 
in Magma by David Kohel and in SageMath by Bober, Alia Hamieh, Victoria de 
Quehen, William Stein, and Gonzalo Tornaria. 

In a little more detail, the method of Brandt matrices runs as follows. Let B be 
a definite quaternion algebra of discriminant D := disc B, a squarefree product 
of the primes that ramify in B. Let O C B be an Eichler order of level M with 
gcd(D, M) = 1, and let N := DM. Let ClsO be the set of locally principal 
(equivalently, invertible) fractional right O-ideals up to isomorphism (given by left 
multiplication by an element of B*). Then Cls QO is a finite set, so let Cls3O = 
{(11], Lol, ..., Wn)} with h := #Cls O. Let OL (/;) be the left order of J;, and let 
Ty := O,(/i)* be its unit group with #1; < oo. Let gj := nrd(/;). Forn € Zs, 
define 


O(n)i,; = Ti\{a € 117! : nrd(a)qiq; | =n}. 


We have a € ©@(n);,; if and only if a/; € J; with index n2. To connect this with the 
previous paragraph, we have 


Qi: 11," > Z 


Qj(w) = nrd(a) (5.3.3) 
qj 


is a positive definite integral quaternary quadratic form of discriminant N* whose 
theta series descends to a modular form of level N—in the notation above, we have 
"Oi; (n) = #O(n);,;. In this way, we can compute a matrix for the Hecke operator 
[T,] acting on S,([o(V), x) by quaternionic arithmetic: for weight k = 2, the 
matrix [T,,] is the adjacency matrix of the directed graph with vertex set Cls O and 
directed edges between [J;] and [/;] with multiplicity #0 (7); ;. 
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The method of Brandt matrices has several advantages. First, the forms computed 
this way are necessarily new at all primes p | D, so linear algebra with degeneracy 
operators can be minimized. Second, the matrices [7;,] of Hecke operators are 
sparse: for example, in weight k = 2 they have nonnegative integer coefficients 
whose columns sum to o(7). Accordingly, linear algebra steps have an improved 
complexity both in theory and in practice. Third, Brandt matrices also carry useful 
arithmetic information about the reduction of modular curves at primes of bad 
reduction. Fourth, the set ©(n);,; is independent of the weight k and so may be 
reused. Despite these advantages, the main limitation of Brandt matrices seems to 
be that it works most simply when there exists a prime p that exactly divides the 
level N (so that an Eichler order of reduced discriminant N exists); otherwise, we 
must work with non-Eichler orders. Hence current implementations focus on this 
case. 

The Brandt graph is an expander graph by the Ramanujan—Peterson bound, so 
with short vector computations one can compute a set of representatives for Cls O 
and a spanning set for S,(To(V), x) using O(h?) operations; computing a basis 
from this is a matter of sparse linear algebra and can be considered to be negligible. 
To compute a single matrix [T,,], in principle we could use Minkowski reduction 
(together with some awkward corner cases) on ho (n) right ideals using O (hn) = 
O (dn) operations. To compute a basis of g-expansions to precision O(q’) with 
d = O(r), for each of the h classes we can enumerate elements of small reduced 
norm using the Fincke—Pohst algorithm in time proportional to the volume so 
O (dr*), performing reduction with the same complexity. These heuristics match the 
running time of modular symbols with linear algebra again eventually dominating— 
however, it is here where sparse linear algebra may ultimately in practice give the 
Brandt matrix method an edge. 

A method that shares much in common with Brandt matrices is the method of 
graphs due to Mestre [65] and Oesterlé. We suppose that p || N and work with the 
quaternion algebra B of discriminant D = p. We recall that there is an equivalence 
of categories between supersingular elliptic curves over an algebraic closure of Fp 
under isogenies and invertible right (or left) O-modules under homomorphisms. So 
to compute a matrix for the Hecke operator, in place of Cls O we can compute the 
set of isomorphism classes of pairs (E, C) where E is a supersingular elliptic curve 
in characteristic p and C is a cyclic subgroup of order M = N/p, and in place 
of the sets @(n);; we can enumerate cyclic isogenies between these points up to a 
natural equivalence. 

Finally, a related method of Birch [7] (who sought to generalize the method of 
graphs beyond discriminant D = p) uses ternary quadratic forms instead. This 
method captures all of the advantages above, with an additional feature: work in 
progress by Hein—Tornarfa—Voight shows that one can carve out not just a new 
subspace but moreover one can specify the Atkin—Lehner eigenvalue, reducing the 
total dimension and thereby the complexity of linear algebra operations. 
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5.4 Other Methods 


We conclude by briefly indicating two other methods in addition to the above. 


¢ Multiplying forms of lower weight. We compute a presentation for the graded 
ring of modular forms of level NV 


M(T1(N)) = GB MP1 (N)) 


k=0 


(or the same for '9(V)) in terms of a finite set of generators and a Grébner basis 
for the ideal of relations among them; see work of Voight—Zureick-Brown [96] 
for an explicit description of this graded ring in terms of the genus and number of 
cusps for I’; (4) (and more generally in terms of the signature of the uniformizing 
Fuchsian group) as well as further references and discussion. From this, one can 
compute for each weight k a set of (leading) monomials in the generators that 
are a Q-basis for M;,(I'|(NV)). Using fast Fourier techniques, the multiplication 
of these g-expansions allows the computation of a basis for large weights k (and 
fixed level NV) quite efficiently in comparison to any of the approaches above. 

¢ Polynomial-time algorithms. By work of Edixhoven—Couveignes [39], Bruin 
[12], and Mascot [63], one can compute coefficients of modular forms of level 
1 in polynomial time: for example, for the modular discriminant A(q) = 
yy, Tq” € Si2(1), the value t(p) for a prime p can be computed in time 
bounded by a fixed power of log p. 


6 Two Technical Ingredients 


In this section, we consider two technical results that are needed in the above 
algorithmic description. 


6.1 KEichler-Selberg Trace Formula for Newforms 


We first prove a technical result that is used by Belabas—Cohen [4] in the computa- 
tion of modular forms in Pari/GP [72], as explained above: we describe the trace of 
Hecke operators on the new subspace in terms of the trace on the total space. 

Let x be a primitive character of conductor Q | N and k a positive integer 
satisfying x (—1) = (—1)*; we take these to be fixed and suppress their dependence 
from the notation. 

For any positive integer n, the nth Hecke operator T,: Sx(N, x) — Sx(N, x) 
may be defined by 
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1 az+b 
(MAQ=— DP x@at Yo (3 ) 


b mod d 


ad=n 
gcd(a,N)=1 
Then 


CO 


GiHoO= > Y x(d)d"'amp e(mz), 
m=1 \ d\(m,n) 
(d,N)=1 


where f(z) = )->-_, dme(mz). This operator stabilizes the subspace Spe (N, Xx). 
Let { Fv in , be a basis of normalized newforms for Seow (N , X) and write 


fj) = <> an, j(me(mz). 


m=1 
We assume that each fy, ; is an eigenfunction of T,, of eigenvalue ay, ;(n) and define 


SN [o,@) 


&n = Yoav, j) fv.) ce 3 e(mz) Tr (Tn in| sn, x))- 


j=l m=1 


We parameterize the basis of S,(N, x): for M1, M2 € Zs, with Q | M, and 
MM? | N, let 


fae j 2) = fom, j (Maz). 
Then 
thie j Mi, M2 € Zs1, | Mi, MiM2 | N} 
is a basis for S,(N, x). Let us extend the definition of ay, ;(n) to Qso by writing 


an,j(x) = Oifx ¢ Zs}. 
If gcd(n, N) = 1, then 


fay = a3 »D x(da"am,i( ara) elm) 


m=1 d|(m,n) 
(d,N)=1 


= 25 am, (ze -) am, j(n)e(mz) = amy j(™) fy j> 


m=1 
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so each fae 3 is an eigenfunction of 7,. To compute the action of 7, when 
gcd(n, N) > 1, we need the following theorem. 


Theorem 6.1.1 Let p | N be prime, let a € Zo, and let r := ordpy Mo. Let xm, be 
the character modulo M, induced from x. Then 


a fl ae ifa—r <0; 
M,j — = M> p-" wy Mp p7t! . 
" am,,j(P" "fu, = — Xm (P)P Ram," fu) far >. 


(6.1.2) 
Proof By the definition of the Hecke operator, we get 


Tye «fur j = > x(d)d*"am,,j (Go etme) = Yen, j ( a )ecmey, 


m=1d|(m, p oF 
(d,N)= 


Ifa —r <0, then 


M: m M2 p~* 
Tye Mise = 1D aM), j Gas ar | e(mz) = fu.j 


Assume that a —r > 0. Since fy,,; is a normalized newform for I9(M1), we 
get 


m = 
emi (Gap) tn 


aM,,j (4 if p | Mi, re 
=, i { dp ( ), = } — a—2e—r . ne 
pee eae G6 3) ume cs (a), if p{ My. 


Then, if p | M1, we have 


M. * Mop" 
Tye fu, j = am,,j(p)* -fup i 


We now assume that a —r > O and p{ Mj. Ifa —r = 1, we have 


mp” m 7 7 mp2 
am, j ( Ma ) = dM,,j (a=) -amy,j (P®") = Sora, im>1xX(P) PP am, j ( Ma 


By taking the summation over m € Z>1, we get: 


M2 p~ 


o a-l 
M if = mp 
Tye fg) = mj (P* fu, — xP) | Yo am; ( ) e(mpz) 


m=1 
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_ Mo p-" k-1 -M —r+l 
=am,,j(P" fu) —X(P)P fu 


Note that when r = 0 we have M, M2p | N, since p | N. 
Ifa —r—2 > 0, by changing a to a — 2, we get 


min{ord, (m),a—2—r}+1 


a—2e—r 
= m a—2-r - e mp 
x(p) p* ‘amy, (a=) aM, j (p 2 ) = » (x(p) p™ D) aM,,j (~_—) : 
By subtracting from (6.1.3), we get 
oz k-1 —r—2 m mp 
fam,.j(P" "Y= x(p) pe amy, (p"" ams (Gas) = am,,j ( Mb ) 
= (x(t iynren aMy,j aa) , if 0 <ord,(m) <a—2-,r, 
—(a—r) : 
te (x(p) p*- _ "aM, (4) if ord,(m) =>a—r, 
0, otherwise. 
After taking the summation over m € Z> on both sides, we get 
foe} 
2 aM, j es ~) e(mz) = Tee 
_ = 2 My p~" 
= fam,.j(P" ")— x(p)p*"am,,j(p* 5); iia 
a—2—r 90 mp*—2-"-£ 
+ ye (x(p) pk 1)! ps aM,,j (=~—) e(mp*z) 


£=0 m=1, 
ptm 


— (x(p)p* 1)" Yam, j (a=) e(mp*~"2). 


m=1 


For the last piece, we have 


pa (Gas =) emp" 2) = fee @). 


Now consider 


(oe) 


» aM,,j (7-5) e(mz) = feo "(2 - > am, j (Gres) e(mpz). 


m=1, 
ptm 
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Since 


am,,j(P)° i = Dawns (Gap 


we get 


=) etme + x(p) pho! pier" (ey, 


Mj 


diam Blea =) e(me) = fut (@)— ami) fans + xe" Fee 
ps 


Mj (2). 
For each 0 < £ < a —2-—r, we get 
eo a—2—r—£ 
mp e 
am ( = ) comp z) 
ho M2p" 
ptm 
—2-r— My p-"t! Mp ptt Mb pat t2+6 
= amy (pT) | ee) = aan (Pfu + xP tye @h. 
So we finally get 
Mort 
eae = {am j(P°") — x(p) Pam, j (p® >> | furs 
a—2-r 
a (x(p) pk) Ha aM,,j (ars) 
£=0 
M. +e Mp p7rtl+ Mp pa" t2+& 
Aer mI: EO 
M- —2r 
— (x(p) pe fee 


Mj 
For s € Zo, we have 


amy,j(P*) -a4m,,j(P) = 4m, (p*t!) + x(p) ps 'am,,j (p*—'), 


So we get 
aera +14e 
= ~2-r— Mop 
Yo yp) am,,j (>* oe ‘) amy,j(P)> furs 
l=0 


a—2—-r 


2 
Es M —r+l+e 
YOO yma err tia 
£=0 
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e—2—r 
a » (x(p) p*") am, j [eo oe 
(0 
a—1—r 
= © op )’am,, 9° > fen 
f=1 
> -r+e 
+ wp yams (et) ee 
t=l 


Then we have 


: ; - a = 
Ta funy j = fam,.j(P" = x(p) p* ‘am j(P* ° | fuj 


a 


=—2=r 
2 a Mp pt 

i y- (x(p) pk) a,j (p° 2-r ,) vie 

£=0 

ar +2 

= —r— Map" 

+ Dorp ''am.j (vp ‘) Mid 

£=2 

a@—l=r 


a=l=r 


S a M- —r+é 
— DY aw am, (p* es *\. fied 
l=1 


a = M a—2r 
=O lt 


= Mp p-" _ aes M.- —r+l 
=am,,j(P™ fy) — (x(P)P* Yam, iP) far 


Combining, we obtain (6.1.2). oO 


Forn, N € Zs, we write gcd(n, N°) for the largest positive integer d such that 
d|nandd | N¥ for somek € Z>1, i.€., 


edn n= |] pe, (6.1.4) 
plgcd(n,N) 


The following corollary is then immediate. 


Corollary 6.1.5 With notation as above, we have 


N/M k-1 new 
Tr(F, N, = d {| ———_—__ b)x (b)b’ Tr(T 1 M, ‘ 
(Tn | Sk(N, x)) ) (aaa) > ub) x (b) r( ts | Sp (M, x)) 
McZ>, b?|gcd(n, N©) 
M|N gcd(b,M)=1 


cond(x)|M 
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6.2 Certifying Generalized Eigenvalues 


Second, we show how to certify generalized eigenvalues. Consider the generalized 
eigensystem 


Ax = ABx, (6.2.1) 


where A and B are real symmetric n x n matrices, with B positive definite. Choosing 
R such that B = R? R and making the change of variables x = R~!y, this becomes 


Aly = ny, (6.2.2) 
where A’ = (R7')’AR7!. Note that A’ is again symmetric, so there is an 
orthonormal basis {y1,..., yn} with A’yj = djyj. We set xj = R-1y,;, so that 


the x; are orthonormal with respect to the inner product defined by B. 
Suppose that we have found approximate eigenvalues 1; and eigenvectors x, i.e. 
so that e; = (A — Aj; B)x; is small. Let 
n 
Xj = > CjkXk 


k=1 


be the expansion of x; in terms of the eigenbasis. For any ¢ > 0, define 


Vj.c = Span{xyz : |Ag — Ajl < e}, (6.2.3) 
and let 
HS YO opea (6.2.4) 
k 
|Ag—Ajl<e 


be the orthogonal projection (with respect to the inner product defined by B) of x; 
onto V;,-. Then we have 


n 
v; -Bvj. =X; BE; — Ss Coy = ET BE; — 07 DY cig (ae — 5)? 
{k:]ke—Aj |e} at 
= x, Bx —e 7 [(B'A —1,)x;1" BUB'A —A,)x;] 
= X} BE; - ee Ble; > ET BE; - eb" le;?, 
(6.2.5) 
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where b > 0 is the smallest eigenvalue of B. Note that this is positive if 


and thus V;,- is non-zero. Hence, there is an eigenvalue A, in the interval J; = 
[Aj — €j,A; + 6;]. 

Suppose that we are in the favorable situation that the J; are pairwise disjoint. 
Then our system has distinct eigenvalues, and we may assume without loss of 
generality that A; € J;. Next, let 6; = min{|A — ajl tA E Ung; Ik}, so that 
(Aj — bj, rj +6;) contains 4; and no other eigenvalues. Put A; = x; — Vj,d;> where 
Vj,¢ and vj, are as above, so that x; — A; is an eigenvector with eigenvalue A ;. To 
use this in practice, we bound the coordinates of A; from above and add them as 
small error intervals onto the coordinates of x;. (The resulting vector must then be 
renormalized in interval arithmetic, according to whatever convention we use, e.g. 
first Fourier coefficient 1.) To that end, we have 


n 
2 T 2 -2 2 ~ 9 
IRAP =ASBAj= DT chy $87" D ici x — Ay) 
{k:|An—A j|28;} k=1 
= 877[(B7'A —4,)%)" BBA — 1))5)] = 857e) Boe), 
(6.2.6) 


[Aj <8j! yor leh Bote; < it (62.7) 


J 


and thus 


Finally, to estimate b we first compute a double-precision approximation P to the 
orthogonal matrix which diagonalizes B. We then compute in interval arithmetic the 
matrices 


S= (sj) = P™BP and T= (jx) =P P. 
By Sylvester’s law of inertia, we have b > A for any A such that S — AT is positive 


definite. In turn, by the Gershgorin circle theorem, this holds if the diagonal entries 
sj; and t;; are strictly positive and 


A> A* := min 28jj — dui lSjxl = Dee Ise 


6.2.8 
j rata ee 


Hence b > A*. 
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7 A Sample of the Implementations 


7.1 Comparison of Methods 


In the course of our computations we made extensive use of the modular forms 
functionality included in both Pari/GP [72] and Magma [11]. In this section we 
compare the performance of the two implementations on a small but representative 
subset of the modular forms we computed: all newforms of weight k and level N 
with Nk* < 1000 and k > 1. We exclude the case k = 1 from this comparison 
because it is not fully supported in Magma and the algorithms used to compute 
weight one forms are substantially different. For modular forms of weight k > 1 
the Magma implementation is based on the modular symbols approach, while the 
Pari/GP implementation uses the explicit trace formula. 

For each level N in our chosen range we fix a representative Dirichlet character 
x for each Galois orbit [x] of modulus N, and for each newspace Nias (N, x) with 
k > 1 and Nk? < 2000 we carried out the following computations in both Pari/GP 
and Magma: 


(1) Determine the dimensions of the irreducible subspaces of Se (N »[x]) (the 
newform orbits). 

(2) For each newform orbit [ f], compute the first 1000 integer coefficients ¢, of the 
trace form Tr(f) = > ,5) nq”. 

(3) For each newform orbit [f] of (absolute) dimension d < 20, compute a 
(reasonably nice) defining polynomial for its coefficient field K and the first 
1000 algebraic integer coefficients a,(f) € K for a constituent newform f/f. 

(4) For each newform orbit [ f] of dimension d < 20, compute an LLL-optimized 
basis for its coefficient ring and express the first 1000 coefficients a, (f) in this 
basis. 


As can be seen in Tables 5 and 6, the explicit trace formula approach used by 
Pari/GP is faster overall than the modular symbol method implemented in Magma, 
especially for spaces that consists of a single Galois orbit, but for newspaces that 
split into multiple Galois orbits it is typically slower, and in general Magma is able 
to decompose newspaces into Galois orbits more quickly than Pari/GP. The large 
advantage Pari/GP has on irreducible spaces is due to the fact that in this situation 
we can use mfsplit to determine that the space is irreducible without actually 
computing any eigenforms, and then use mEt raceform to compute the trace form 
for the entire space. 

In Table 7 we list the 10 newspaces in our chosen range that were the 
computationally most difficult for either Magma or Pari/GP. In each case, the 10 
most time consuming newspaces accounted for approximately half of the total time 
to process the 7621 nonzero newspaces in our test range. 

Notably, only two newspaces (467.2.c and 497.2.c) were among the computa- 
tionally most difficult for both methods (these are the two newspaces of largest 
dimension in our chosen range). Most of the newspaces listed in Table 7 were 
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Table 5 Magma 2.24-7 vs. Pari/GP 2.12.1 (Intel Xeon W-2155 3.3 GHz); timings for newspaces S 
of level N > 1, weight k > 1 by Nk? range 


Split time (s) Total time (s) 
Nk? Num S | Num f | >> dim(S) Magma Pari/GP Magma Pari/GP 
[1, 200] 183 214 897 0.4 1.1 73.8 18.2 
[201, 400] 453 709 7560 3.5 17.2 302.4 116.6 
[401, 600] 574 1050 21452 22.2 50.2 643.4 220.1 
[601, 800] 677 1326 43 515 132.1 70.8 2444.8 300.6 


(801, 1000] 764 1542 71358 751.3 322.3 9216.4 728.2 
[1001, 1200] 879 1805 109 570 2653.1 1 253.3 36 940.0 2 347.6 
{1201, 1400] 905 | 2001 152 344 8 889.0 5517.0 | 161327.7 | 11855.8 
[1401, 1600] 995 | 2284 203492 | 24841.1 | 21256.5 | 349656.8 | 67233.4 
[1601, 1800] | 1032; 2420 264506 | 63476.2 | 59392.6 | 952669.0 | 194 405.8 
{1801, 2000] | 1157 | 2378 331348 | 79307.2 | 175 890.1 | 1752 685.4 | 596 779.2 
7621 | 15731 | 1206658 | 180089.5 | 263771.8 | 3266 135.9 | 874006.0 


Table 6 Magma 2.24-7 vs. Pari/GP 2.12.1 (Intel Xeon W-2155 3.3 GHz); timings for newspaces 
S of level N > 1, weight k > 1, Nk2 < 2000 by #8 := #{f € S} 


Split time (s) Total time (s) 

#S | Max dim(f) | Num S | Num f | >> dim(S) Magma Pari/GP Magma Pari/GP 
1) [1, 200] 2859} 2859} 161375 423.8 529.9 11 967.2 818.0 

1} (201, 2000] | 1027) 1027; 544092); 26060.6| 55272.8| 701094.2| 53727.6 

1 | (2001, co] 65 65} 215016) 146044.1 | 170 751.3 | 2226371.4 | 163 789.9 
2 | [1, 200] 1703} 3406; 100080 278.7 660.9 4233.8 | 30837.0 
2 | (201, 2000] 145 290 95704} 4192.1 8188.7 | 188 745.7 | 576 764.6 
2 | (2001, co] 4 8 10870} 2636.5} 26821.1 97 329.8 | 24785.1 
>3/[1, 20] 873) 4785 19 282 46.2 64.1 1596.2 1197.9 
>3 | [21, 200] 235| 1155 23 135 160.8 275.7 1228.8 5255.3 
>3 | (201, co] 3 15 1024 12.0 347.7 1364.5 357.4 
7621} 15731} 1206658 | 180089.5 | 263 771.8 | 3 266 135.9 | 874 006.0 


computationally much more difficult for one of the two methods: on the largest 
irreducible spaces in our test range Pari/GP is typically at least ten times as fast as 
Magma, but for newspaces that split into two large Galois orbits Magma is faster 
than Pari/GP by a similar (or even larger) factor. This suggests that the optimal 
approach is to use the explicit trace formula and modular symbol methods in 
combination. Indeed, a hybrid approach that uses Magma to decompose the space, 
and then delegates the computation to Pari/GP whenever the newspace contains a 
Galois orbit of dimension at least 2/3 the dimension of the newspace, takes a total of 
264 726 seconds; this is more than 3 times faster than using Pari/GP alone and more 
than 10 times faster than using Magma alone. 
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Table 7 Some computationally challenging newspaces 


Newspace (Q(x) : Q] Decomposition Magma(s) Pari/GP(s) 
413.2. 28 420 + 420 68.91 23 711.71 
419.2.g 180 6120 79 654.08 5 175.04 
424.2.v 24 1248 60 111.92 150.62 
431.2.g 168 5880 82 907.51 5 333.59 
435.2.bf 12 240 + 240 39.63 25 272.26 
443.2.g 192 6912 180 453.61 8 134.21 
454.2.c 112 1008 + 1120 1197.47 44 216.52 
467.2.c 232, 8816 370 791.77 22 719.24 
472.211 28 56 + 1568 103 117.42 562.40 
478.2. 96 960 + 960 861.98 87 147.90 
479.2.c 238 9282 363 002.59 26 148.67 
486.2. 54 702 + 756 351.60 139 762.27 
487.2.k 162 6480 110903.14 6 766.85 
489.2.q 54 702 + 756 202.91 38 345.59 
491.2.k 168 6720 12 1405.39 8558.33 
497.2. 24 408 + 456 99.20 18 438.91 
498.2.f 40 560 + 560 269.01 48 844.21 
499.2.g 164 6724 119 807.20 12 148.53 


7.2 A Trace Formula Implementation with Complex 
Coefficients 


In this section, we describe an implementation of the trace formula using ball 
arithmetic over the complex numbers due to Bober [8]. This implementation follows 
the description given in Sect.5.2. The main focus here is to compute a moderate 
number of coefficients for all of the newforms in a given space S/°“(N, x) as 
approximate complex numbers, which enables the computation of modular form L- 
functions at small height, for example. These computations use Arb [51], a C library 
which implements arbitrary precision ball arithmetic, so that we can ensure that all 
of our computations come with rigorous error bounds. There is also some facility 
for computing with coefficients in a finite field Fz, where £ is some prime congruent 
to 1 modulo the order of x, which is used in the computation of characteristic 
polynomials of Hecke operators, for example, and in other auxiliary pieces. The 
package also contains some limited functionality to compute information about 
spaces of weight one modular forms, which we do not discuss here. 

We describe briefly some details of how this implementation works in practice. 

To start a computation we first choose a prime and determine a set of trace forms 
which will give a full rank basis of the space of newforms modulo this prime, avoid- 
ing any issues of computing the rank of a matrix with floating point entries. Specif- 
ically, we find some matrix of coefficients (Tr(T (m;)T (nj) | Se“ (N, X)))1 <i, j<a 
that has full rank, and we also choose our m; and nj; so that gcd(mjn;, N) = 1, 
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which will make later computations easier. Once we know which computation will 
give us a full rank matrix, we do the computation again over the complex numbers, 
computing additional coefficients so that we will be able to compute the action of 
Hecke operators. At this point we find a sum of Hecke operators T = >, cnTn 
such that the characteristic polynomial of T is squarefree (keeping c, = O when 
gcd(n, N) # 1). 

The diagonalization of T would in general be a computation over the complex 
numbers, but because we have chosen to only use Hecke operators coprime to the 
level, we can use knowledge of the arguments of the eigenvalues to turn this into 
a problem of diagonalizing a real symmetric matrix. This problem is solved by an 
implementation of Jacobi’s algorithm, certifying the result as described in Sect. 6.2. 
Once we have diagonalized, we obtain a change of basis matrix that takes our 
trace form basis to the newform basis, and we can compute as many coefficients 
of newforms as we like by evaluation of the trace formula. 

Once we have computed all the embeddings of all of our newforms, we may also 
wish to compute the decomposition of the space into Hecke-irreducible subspaces. 
To do this we will compute the characteristic polynomial of a linear combination T 
of Hecke operators (it is sufficient to find one that is squarefree). If we have enough 
precision in the Hecke eigenvalues we have computed, we can do this simply by 
forming the product [[, (x—A), where A ranges over the eigenvalues of T. In general 
we will find that we do not have enough accuracy to uniquely identify a polynomial 
with coefficients in Z[x], however, and we refine the computation by computing this 
polynomial modulo ¢@ for enough small primes ¢ to obtain the polynomial exactly. 

The factorization of this Hecke polynomial gives the decomposition of 
S LN ,X) into Hecke irreducible subspaces. However, there is still one more 
problem we may be faced with: namely, identifying which embeddings correspond 
to which subspaces. In this problem we have a set of polynomials fi, f2,..., f, and 
a set of approximations of complex numbers 7), 72,...,7¢, and all we need to do 
is determine which complex number is a root of which polynomial. This may seem 
like a relatively trivial problem, but in fact these polynomials may be enormous and 
obtaining enough precision in the roots to solve this by simple evaluation may not 
be feasible. 


Example 7.2.1 To give a moderately-sized example, we can consider the space 
766.2.c. This space is only 32-dimensional over the field of definition of x, but there 
are 190 Galois conjugate characters to consider, so the full degree is 6080. The 
characteristic polynomial of 73 acting on the space S}°” (766, [x]) is squarefree and 
factors into 2 irreducible factors of degree 3040; each factor contains more than 1.5 
million decimal digits. 


To make this problem tractable, we again make use of the arguments of the 
eigenvalues. Let f(x) be one of these irreducible factors. We know that each root of 
f can be written as ¢t for some root of unity ¢ and some real number f, and we find 
that ¢ is a root of the greatest common divisor of f(¢x) and fx) in Q(¢)[x]. In 
fact, as we prefer to work with real numbers, we compute 
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gcd(f (ex) + f(¢-'x), if (ex) — fE'x) € QE +07) Ex]. 


In principle, computing this greatest common divisor when we have only floating 
point approximations available could be troublesome, but it is possible because we 
know what its degree is. 


8 Issues: Computational, Theoretical, and Practical 


8.1 Analytic Conductor 


Earlier efforts to tabulate modular forms have tended to compute all newforms in 
particular boxes, where the weight and level each vary in a specified range. This 
approach is easy to describe, but the computational complexity of finding newforms 
with simultaneously large weight and level ensures that some newforms of interest 
will be missed (either large weight or large level). Instead of working with boxes, 
we organized our computation around a single invariant which scales with the 
complexity of the newform. 

Introduced by Iwaniec—Sarnak [50, Eq. (31)] (see also [waniec—Kowalski [49, 
(5.7)]), the analytic conductor of a newform f € Se (N , X) is the positive real 
number 


k/2))\? 
A:=N (eee) ; (8.1.1) 
20 
where w(x) := I'’(x)/T'(x) is the logarithmic derivative of the Gamma function. 


The analytic conductor includes a factor that can be thought of as measuring the 


complexity at infinity. We have A ~ as k — ov, so for simplicity we 


1672 
organized our computations by specifying bounds on the quantity Nk?. 


8.2 Sturm Bound 


In this section, we elaborate upon bounds for truncations of qg-expansions of 
modular forms that determine them uniquely. The most well-known of these bounds 
is due to Hecke (and more generally to Sturm [91, Theorem 1]). 


Theorem 8.2.1 (Hecke, Sturm) Let [ < SL2(Z) be a congruence subgroup and 
let f be a modular form of weight k for. Then f = 0 if and only if an(f) = 0 for 
all0 <n < k[SL2(Z) : P]/12. 
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In fact, for modular forms with character as in our setting, one can apply a sharper 
bound (as though it was without character) as follows. 


Definition 8.2.2 Fork, N € Zs1, define the (Hecke-)Sturm bound 


k Nk 1 
Sturm(k, N) := 75[SLo(Z) : Poy] = 55 (1 r ~). 
p|N 


Proposition 8.2.3 (Hecke, Sturm) Let N,k > 1 and let x be a character of 
modulus N. Let T C Endc(S;(N, x)) be the Z-subalgebra generated by the Hecke 
operators T, for alln € Zs , and let Z[x] C C be the Z-subalgebra generated by 
the values of x. Then there is a natural inclusion Z[x|] — T, and the following 
statements hold. 


(a) If f € Se(N, x) has an(f) = 0 for alln < Sturm(k, N), then f = 0. 
(b) T is generated as a Z[ x |-module by T,, for alln < Sturm(k, N). 
(c) 7 is generated as a Z| x |-algebra by T, and T, for all primes p < Sturm(k, N). 


Proof For the inclusion Z[x] <> 7, we argue as follows: from the Hecke recursion 


T,2 —T, + x(p)p*' =0 (8.2.4) 
for p { N, we see that x (p) p*—! € T; choosing two distinct primes p congruent 
modulo N and applying the Chinese Remainder Theorem shows that Z[x] C 7. 
Consequently, 7 is a torsion free Z[x ]-module. Since Z[x] is a Dedekind domain, 
T is locally free. 

Abbreviate S := S,(N, x; Z[x]). We claim that the pairing 


TxS>2Z[x] 
(8.2.5) 
(T, fp a (Tf) 
is perfect, i.e., the map 
g: S > Homzy\(7, Z[x]) 
(8.2.6) 


fr Tr a(Tf)) 


is an isomorphism. When Z[x] = Z, this is an argument of Ribet [77, Theorem 
(2.2)], and we only need to make a small modification. The map ¢ is injective with 
torsion-free cokernel because a; o T, = da, and the map taking a form to its q- 
expansion is injective. Since S and 7 are locally free Z[x ]-modules of finite rank, 
it suffices to show that the rank of 7 is at most the rank of S (localizing at primes [ 
of Z[x ]). To this end, consider the other map induced by the pairing, namely, 
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w: T — Homz,y\(S, Z[x]) 
Tre (fh a(Tf)). 


(8.2.7) 


We claim that w is injective. Indeed, if T € kerw, then for all f € S andalln > 1 
we have 


0=a@(T)(Thf) = 411 Tn f) = a1(InTf) = an(TPf) 


as JT is commutative. Since the g-expansion map is injective, we conclude Tf = 0 
for all f, so T = 0 as an endomorphism, proving the claim. Finally, localizing w at 
(, the injectivity of w implies the desired rank bound. 

We now prove (a) following Buzzard, and suppose that f € S;,(N, x; Z[x]) has 
an(f) = 0 for all n < Sturm(k, N). Let d be the order of x, let s := Sturm(k, N), 
and consider f4 € Syx(Mo(N); Z[x]). Since f = O(g*t!), we have f4 = 
O (q@6t)), Moreover, Sturm(kd, N) = ds, so by the Sturm bound (Theorem 8.2.1) 
applied to Saz(1'9(NV); ZLx ]) we conclude ti = 0, which implies f = 0. 

To prove (b), let Tz, CG T be the Z[x]-submodule generated by T,, withn < 
Sturm(k, NV). By the previous paragraph, the pairing (8.2.5) restricted to T=, is still 
perfect; indeed we can simply argue with 7, in the injectivity of w in (8.2.7). We 
conclude that Tz, = T. 

For part (c), we use multiplicativity to see that 7,, for n composite is contained 
in the algebra generated by the prime power operators 7,,-, and then the Hecke 
recurrence and induction to see that 7), is contained in the algebra generated by T; 
and T),. 


8.3 Atkin—Lehner Operators and Eigenvalues 


Let x be a Dirichlet character modulo N. For M | N with gcd(M@, N/M) = 1, 
there are unique characters x (mod M) and xn/;m (mod N/M) such that x = 
xm Xn/m- The Atkin—Lehner—Li operator Wy [2, $1] maps the space S;(N, x) to 
Sk(N, XuxXn/m)- In general X¥uxXn/m is different from x, so then this operator 
cannot be used for splitting up spaces. We have X¥yxn/m = x when the character 
Xm is trivial or quadratic, and in these cases, Wy is an involution on the space 
Si(N, x), which then splits as the direct sum of +1l-eigenspaces. Magma only 
implements Atkin—Lehner operators on spaces with trivial character, where they 
commute with all Hecke operators and hence map every newform f to +f, and 
the sign + is the Atkin—Lehner eigenvalue of f with respect to M. (By a common 
abuse of notation and terminology, when M is the power of a prime g not dividing 
N/M, the operator Wy is often denoted W,.) In our computations we only compute 
Atkin—Lehner eigenvalues on newforms with trivial character. 

In general, the image of a normalised newform f in Sx(N, XuxXn/m) under 
Ww is a multiple of a normalised newform in S,(N, XuxXwn/m), and the multiple, 
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not necessarily +1, is called the pseudo-eigenvalue of Wy on f. Atkin—Li [2] do 
not give a general formula for pseudo-eigenvalues, which are not always easy to 
compute in practice. See also Belabas—Cohen [4, §$5-6]. 

When M = | the operator Wy is trivial, while when M = N it is called the 
Fricke involution. The Fricke involution is the product of all W, for primes g | 
N (using the convention of the previous paragraph.) For a newform f with trivial 
character, its Fricke eigenvalue is minus the sign of the functional equation of its L- 
function, and each W,-eigenvalue is the sign of a certain local functional equation. 


8.4 Self-Duality 


The coefficient field of a newform f € S;(N, x) is either totally real or CM [75, 
Prop 3.2]; we say that f is self-dual in the totally real case. Computing the coefficient 
field can be time consuming, so we use the following easier criteria when applicable. 


Proposition 8.4.1 (Ribet) Let f € Sy(N, x) have Hecke orbit of dimension d and 
trace form Y~~ 9 tng". Then the following statements hold. 


(a) If x is trivial or d is odd, then f is self-dual. 
(b) If x has order larger than 2, then f is not self-dual. 
(c) If there exists a prime p so that ty # 0 and x(p) # |, then f is not self-dual. 


Proof See Ribet [75, Propositions 3.2 and 3.3]. 


8.5 Efficiently Recognizing Irreducibility 


Level N = 2 is by far the most time-consuming case for Magma (note that this 
allows for the largest range of k with N ¥ 1 for any given bound on the analytic 
conductor). For k > 400 with 4 | k, each space takes more than 12 hours of CPU 
time. However, we observed behavior analogous to the Maeda conjecture in weight 
1 up to weight k < 400, with the additional prediction that the Atkin—Lehner 
operator splits the space as evenly as possible. 


Conjecture 8.5.1 For all k > 2, the space Sx(1o(2)) decomposes under the Atkin— 
Lehner operator W2 into Hecke irreducible subspaces of dimensions |d/2\ and 
[d/2] where d := dimc Sx(To(2)). 


For general N, the difference dim S.(N)* — dim S2(N)~ can be expressed in 
terms of class numbers of imaginary quadratic fields (fixed points of Atkin—Lehner 
involutions), and so this difference is < N!/?+€. Perhaps this is explicit enough for 
N = 2 to prove the dimensions above (but not irreducibility), and something similar 
can be said for N composite. 
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In practice we should be able to prove that the space is irreducible by working 
with just one Hecke polynomial as these are often irreducible. 


Question 8.5.2 Given ann x n matrix with entries in Z[¢,,] (typically sparse), is 
there a fast algorithm that with positive probability correctly determines when its 
characteristic polynomial is irreducible? 


In other words, if you expect that a polynomial is irreducible, can you verify this 
quickly without factoring the polynomial? Under the expectation that the Galois 
group of the polynomial is transitive and therefore likely to be Sg, one could succeed 
in some cases by factoring the polynomial modulo primes. This is different than the 
typical factorization problems solved in computer algebra systems, which compute 
a factorization p-adically and then reconstruct the factorization over Z. (See Table 7 
for some difficult spaces where this would help.) A natural generalization of this 
question would be to efficiently determine the degrees of the irreducible factors of 
its characteristic polynomial (without explicitly computing it). 


8.6 Trace Form 


As defined in Sect. 4.5, each newform f ¢€ S7/°“(N,[x]) has an associated trace 
form Tr(f)(q) = >>, mq" € Sk(N, [x]; Z) equal to the sum of the distinct 
Gal(Q* | Q) conjugates of f and thereby well-defined on its Galois orbit [ f]. More 
precisely, we have Tr(f) € S;““(I") where 


T= {(42) ero): x(@) = 1 DTN), 


(but in general Tr(f) ¢ S,(N, x)). One can thus apply the Sturm bound (The- 
orem 8.2.1) for T°: trace forms of newforms in S;(N, x) with the same Fourier 
coefficients a, for n < k[SL2(Z) : T]/12 must coincide, but note that this will 
typically be larger than the Sturm bound Sturm(k, NV). As noted above, we always 
have t; = [K : Q], where K = Q(f) is the coefficient field of f. 

Trace forms can be efficiently computed using the trace formula. In the common 
case where S;,(N, x) is irreducible, this can be done via the Pari/GP function 
mf£traceform [72], which is dramatically faster than computing the coefficients 
of f as elements of K and taking traces (indeed, it is not even necessary to 
determine K). More generally, if one knows the decomposition of S;(N, [x ]) into 
newform subspaces and has computed trace forms for all but one of them, the 
remaining trace form can be computed by subtracting corresponding coefficients 
of the known trace forms from the coefficients given by mftraceform, which 
computes absolute traces of the Hecke operators 7;, acting on the entire newspace 
Sc (N, [x]). Alternatively, one can sum complex coefficients a, of the Galois 
conjugates of f computed to sufficient precision to allow the sum to be identified as 
a unique integer. 
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The coefficients t, of the trace form at primes p are equal to the Dirichlet 
coefficients of the (typically imprimitive) L-function L(s) = )¢ b,n~* with integer 
Dirichlet coefficients b, obtained by taking the product of the L-functions of the 
Galois conjugates of f. But for nonprime values of n the integer coefficients f, 
do not match the integer coefficients b, unless the newspace has dimension one (in 
which case L(s) is primitive). Indeed t; = [K : Q] cannot coincide with b; = 1, and 
in general the coefficients ¢, at nonprime values of n encode different information. 


8.7 Presenting Coefficients Using LLL-Reduction 


One of the most dramatic improvements we saw, both in performance and in display, 
is in the choice of how to represent coefficient rings. In this section and the next, we 
explain two such methods. 

As explained in Sect. 4.5, one computes a minimal polynomial for the coefficient 
field by factoring the characteristic polynomial of a Hecke operator. This polynomial 
may be unwieldy! So we first apply the Pari/GP function polredbest which 
finds an improved minimal polynomial representing the same field by computing an 
LLL-reduced basis for an order with respect to the Minkowski embedding (whose 
underlying quadratic form is given by the 7>-norm)—this runs in deterministic 
polynomial time in the size of the input. When possible, we improve this to 
polredabs, which applies the same technique but to the maximal order (this may 
require factoring a discriminant, and we frequently encounter situations where this 
is a bottleneck). 


Remark 8.7.1 The function polredabs changed in Pari/GP 2.9.5 (Fall 2017); we 
use the more recent version, described in [28]. 


We can make significant further improvements by optimizing the Z-basis we use 
to represent coefficients. Let f € S/°“(N, x) be a newform. By the Hecke—-Sturm 
bound (Proposition 8.2.3), the coefficient ring of f is generated over Z[ x] by the 
values a,(f) for n < Sturm(k, VN), so by extension we obtain a set of Z-module 
generators for the ring. We reduce this to an LLL-reduced Z-basis for the coefficient 
ring, and we rewrite the coefficients in terms of this basis. In our computations, we 
always use complex precision that is at least as large as the discriminant of the 
coefficient ring. 

We observe the following. 


Lemma 8.7.2 Let F be a number field and let R C F be a Z-order in F.. Then the 
shortest vectors in R with respect to the T-norm are exactly the roots of unity in R. 


Proof The order R contains 1, and 73(1) = n := [F : Q]. More generally, for any 
root of unity ¢ € R, we have 72(¢) = n. Conversely, let a € R satisfy Tr(a) < n. 
Then by the arithmetic-geometric mean inequality, we have 
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T: 1 
is uo) sen >> ey? > I la; [2/" = INmpig@ 2/n 


with equality if and only if jaj| = --- = |a,| = 1. Buta e€ R is integral, so 
INmF\Q | > 1, so equality holds. By Kronecker’s theorem, we conclude that a is 
a root of unity. 


In spite of this lemma, because of nonuniqueness, we may not have | as an 
element of an LLL-reduced basis as there may be more roots of unity than the 
degree, such as in a cyclotomic field. However, using the above lemma we can 
recognize the roots of unity in the coefficient ring and thereby recognize when the 
coefficient ring is cyclotomic itself, where we may institute a canonical basis (see 
also the next section). 

The effect of such a representation is dramatic. 


Example 8.7.3 Consider the newform 153.2.e.c. Its coefficient field is Q(v) where 
v has minimal polynomial 


x29 — 19 4 3418 17 4 13516 — 19415 4 Say! 4 27x13 4 93x12 — 54x!) 


+ 693x!9 — 162x9 + 837x8 + 729x7 + 4374x® — 2916x> + 9477x4 + 4374x3 
+ 19683x7 — 19683x + 59049. 


An integral basis written in terms of the powers of v is too large to record here, and 
similarly the coefficients of f written in a power basis are enormous! 


However, in terms of an LLL-reduced basis Bp = 1,--- , 619, we have 
coefficients 
a2(f) = Bio 
a3(f) = —Bio 


a4(f) = —1— B3 — Bs 


as7(f) = Bo — 63 — Ba + Bs — 387+ 2Bo — Bio 
+ 2613 + Bia + Bis — Bio + 2817 — Bis — Bio 


that are very small integer linear combinations of the basic elements. Moreover, we 
have 
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1= Bo 
v= Bi 
v* = Bg — By — Ba + Bo 


v!? = 5114B19 + 2632B1g + --- — 18071 — 6756 


and the powers of v are reasonably sized Z-linear combinations of our basis 
elements. 


We observe that the matrix that writes the powers of a primitive element in terms 
of the LLL-reduced basis is noticeably smaller than the other way around. Working 
with the coefficient ring itself rather than a maximal order containing it is not only 
more efficient (as it may be prohibitively expensive to compute such a maximal 
order), but it also seems to give better results. 

The intuitive reason that this works is simple: by the Ramanujan—Petersson 
bounds, the coefficients of a newform are of small size in all complex embeddings, 
and so it can be expected that writing it in terms of a Z-basis which is LLL-reduced 
with respect to size provides a small linear combination. 


Remark 8.7.4 In the above, we have been concentrating on the case where f is 
a newform, representing a Galois orbit of newforms, and we write down its q- 
expansion in terms of a Z-basis for the coefficient ring. 

As an alternative, we can consider the C-vector space spanned by f and its 
conjugates under Aut(C), making a C-vector space of dimension say d. These 
conjugates will include conjugates that do not preserve the character, so we would 
either be working implicitly in the direct sum of the spaces over the full Galois orbit 
of characters, or we need to restrict to quadratic characters, or we only consider 
conjugates under Aut(C | Q(x)) and get a Q(x)-vector space. However, inside this 
space is a canonical Q-subspace, namely, those forms whose g-expansion belongs to 
Ql[[qg]]. So we could instead represent the Galois orbit canonically by an echelonized 
basis of d individual g-expansions with coefficients in Q. We could then write 
a representative newform as before as a linear combination of this basis over the 
coefficient field. 

To go from the eigenform to the Q-basis, we apply the operators Tr(6; f) for 
f; any Q-basis for the coefficient field. (To go from the Q-basis to an eigenform 
one needs to retain sufficiently many eigenvalues to do the linear algebra. In other 
words, the eigenform contains more information than the Q-basis.) This generalizes 
the trace form, which is where we take 6; = Bo = 1. 

We could also work integrally and take the Z-module of forms whose gq- 
expansions belong to Z and then take a LLL-reduced basis which minimizes a 
(weighted) sum of finitely many coefficients. It is conceivable that in a world where 
linear algebra over Q is much faster than linear algebra over number fields that 
we could succeed in computing a Q-basis in reasonable time but not succeed in 
computing an eigenform. 
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8.8 Presenting Coefficients Using a Sparse Cyclotomic 
Representation 


When the coefficient ring of a newform is contained in a cyclotomic field Q(ém), 
writing coefficients in terms of an LLL-optimized basis as described in the previous 
section does not necessarily give the most compact representation, for two reasons. 
First, when the coefficient ring is not the maximal order, it may be more compact 
to express coefficients as elements of the maximal order Z[¢,,]. Second, even when 
the coefficient ring is the maximal order, in which case the LLL-optimized basis 
will typically be the standard power basis 1, Gn, on ae Ame the eigenvalues 
a, can often be written more compactly by expressing them as sparse polynomials 
in ¢, rather than integer linear combinations of the power basis. Every integer linear 
combination of elements of the power basis is of course also a polynomial in ¢),; 
the question is whether to allow polynomials of higher degree whose terms involve 
powers of ¢,, that are not in the power basis (because m > }(m) = [Q(ém) : Q)), 
which allows more flexibility and a potentially sparser choice of polynomial. 

This added flexibility is particular relevant for weight one newforms, whose 
coefficients always lie in a cyclotomic field. The correspondence between weight 
one newforms and (odd irreducible) two-dimensional Artin representations [36] 
implies that for weight one newforms the eigenvalues a, can always be written as a 
sum of at most two roots of unity. For composite values of n the eigenvalues a, will 
not be as sparse, but even if one naively expands them as products of polynomials 
in ¢m for each apr, for most a, we obtain an expression with O(2!°8!°8") nonzero 
coefficients (a typical integer n has loglogn distinct prime factors p and p-adic 
valuation | at all but O(1) of them), which is exponentially sparser than a generic 
element of Z[¢,,] written in the power basis. For even values of m we can improve 
on this naive approach by using the identity ra * — —1 to reduce to polynomials 
of degree at most m/2 — 1 in ¢,,; this never increases the number of terms and may 
reduce it. 

For example, the second Fourier coefficient of the newform 3997.1.cz.a is 


_ _,570_, +244 
a2 = —Cigr + 6570» 


143 
eee 


but when written in terms of the standard power basis 1, ¢570, 579 We instead 


have 


2 5 u 12 15 17 19 _ 20 | ,21 24, 4.27 , ,30 _ ,31 
a2 = 1+ b379 + $570 + $570 — $570 — S570 — $570 + $570 — $570 + $570 + $570 + $570 + $570 — $370 


32 4.34 , -35 _ 36 39 42 45 | 46 dT | 49 50, SI 59 
+ $579 — $570 + $570 — $570 — 570 — $570 — $570 + $570 — $570 + $570 — $370 + $570 — $570 


60 _ ,61 64 | .65 _ 66 , »74 _ 475 _ ,78 79 _ 80 , 88 89 , ,90 

+ $570 — $570 — $570 + $570 — $570 + $570 — $570 — $570 + $570 — $570 + $570 — $570 + $570 
93 94.97 100-103 , ,104 105 106 , ,107__ -108 , ,109 , ,112 , ,115 

+ $579 — $570 — $570 — $570 — $570 + $570 — $570 — $570 + $570 — S570 + $570 + $570 + $570 
18 119 , ,120_ 122, 4123 124 127 «130, 4 134, 4137 | 139, 4142 

+ $579 — $579 + $570 — $570 + $570 — $570 — $570 — $570 + $570 + $570 + $570 + $570- 
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Among the 585 nonzero a, with n < 2000, the average number of terms needed 
to express da, as a sparse polynomial in ¢, is 4.1; by contrast, when written in the 
power basis the average number of nonzero coefficients of a, is 42.8. This leads to 
a more than tenfold reduction in storage and a corresponding reduction in the time 
to transmit or render the g-expansion. 


Remark 8.8.1 For modular forms of weight k > 1 with cyclotomic coefficient 
fields Q(¢m) there is no a priori reason to expect the a, to be expressible as sparse 
polynomials in ¢,,, and one can see in examples that this is often not the case. 
One might instead try to apply a more general approach, which, givena € Z[ tm] 
searches for a sparse polynomial f (fj) of degree less than m with small coefficients 
that is equivalent to a. We do not know an efficient solution to this problem, but 
we note that even if one exists, for generic values of @ it is unlikely to result 
in representations that are significantly more compact than using the power basis 
for purely information theoretic reasons: the number of a € Z[¢,,] that can be 
expressed as r-term polynomials in ¢,, using b bits to represent the coefficients 
must be approximately the same as the number of integer vectors of length ¢(m) 
that can be encoded in b bits. For this reason we use sparse cyclotomic coefficient 
representations only for k = 1. 


8.9 Hecke Kernels 


Having determined the decomposition of a newspace S fase (N, [x ]) into Hecke orbits 
Vy corresponding to newforms f, we can compute and store information that will 
allow us to reconstruct a single Hecke orbit, without having to decompose the entire 
newspace again. This is particularly useful when the dimension of a particular 
newform f of interest is much smaller than that of Se (N, [x]). To achieve this 
we compute a list of pairs (p, gp(X)), where p is a prime and g € Z[X] is 
the minimal polynomial of the Hecke operator T, acting on Vy (viewed as a Q- 
subspace of Se* (N , [x])), such that V+ is equal to the intersection of the kernels 
of the linear operators g,(Tp) acting on Se (N, [x]), in other words, the operators 
&p(T,) generate the Hecke kernel of Vy. Such a list of generators can be used to 
reconstruct the newform f in Magma via the Kernel function. 

It is computationally convenient to restrict to primes p not dividing the level N, 
and to use the same list of primes p for all the newforms in Spo (N , [x]). To this 
end, for a set of primes S, not dividing N, and a newform f, we let X ¢(S) denote 
the set of pairs (p, gp»), where g, € Z[X] is the minimal polynomial of T, acting 
on V;, and say that S is a set of distinguishing primes for the newspace aay (N,[x]) 
if the sets X ¢(S) are distinct as f varies over the newforms of Seo (N , x). 

We construct a set of distinguishing primes as follows. We start by taking S to be 
the empty set. If the newspace S7'°”(N, [x ]) consists of a single Hecke orbit, then S 
is a set of distinguishing primes, and otherwise we increase the size of S by adding 
the least prime p { N not contained in S for which 


Computing Classical Modular Forms 177 


{X¢(S) 2 f © SEY (N, [xXD} S (Xp (SU {pp : f © SEN, LD}. 


We observe that the cardinality of the set S constructed in this fashion is at most 
one less than the number of Hecke orbits in S?°’(N, [x ]). This greedy approach to 
constructing S does not necessarily minimize its cardinality, but it does minimize 
the largest p that appears in S, which may be viewed as an invariant of the newspace. 
For example, we may distinguish the 8 Hecke orbits of the newspace 2608.2.g, where 
2608 = 24-163, using p = 3 and 41. In this case 73 distinguishes all the forms with 
the exception of the two CM forms, which both have vanishing a, for all p split 
in Q(/—163), hence the smallest prime p such that a, could possibly distinguish 
them is 41, and 41 does in fact do so. 


Remark 8.9.1 The largest prime p that appears in S may occasionally exceed the 
Sturm bound, as in the case of the newforms 66.2.6 and 735.2.p, for example. This 
fact is relevant in the context of Theorem 11.2.8, which we use to determine the 
group of inner twists of a newform in Sect. 11. This is one reason to compute ap(f) 
past the Sturm bound. 


9 Computing L-Functions Rigorously 


In this section, we describe rigorous methods to compute L-functions of modular 
forms. 


9.1 Embedded Modular Forms 


To a newform f € S?*’(N, x), with q-expansion }°a,q", for each complex 
embedding of the coefficient field « : Q(f) <> C we may consider the embedded 
modular form 


i= yeaa" (9.1.1) 


which is a modular form over the complex numbers. 

We label such forms by N.k.s.x.c.j, where N.k.s.x is the label of Hecke orbit, 
N.c is the Conrey label for the character corresponding to the embedding, and j 
is the index for the embedding within those with the same Dirichlet character; these 
embeddings are ordered by the vector t(a,), where we order the complex numbers 
first by their real part and then by their imaginary part. 

To such an embedded modular form :(f), we may associate a primitive L- 
function of degree 2 
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L(f).8) = Yo tan = | [Lp Wf). p) 
P 


-1 
=[] (1-«@,)p-) "TT (1 — U(dp)p* + x(p)p-*) 
PIN P{N 
(9.1.2) 
Let A((f), 5) = N/T c(s)L((f), 5), where Mc(s) := 2(27)~*T(s). Then 
A(t(f), £5) continues to an entire function of order 1 and satisfies the functional 
equation 


AQ(f), s) = eAQ(f), k— 5), (9.1.3) 


where ¢ is the root number of A(i(f), 5), a root of unity. 

The generalized Riemann hypothesis also predicts that any nontrivial zero of the 
L-function lies on the line of symmetry of its functional equation Re(s) = k/2, 
known as the critical line. To study the behavior of L(i(f), s) on the critical line, it 
is natural to introduce the associated Z-function, a smooth real-valued function of a 
real variable ¢ defined by 


k/2+it 
z/2 y (k/2 + it) 


IDE SE era ag 


LCF), k/2.+ it), (9.1.4) 


where y(s) := N s/ Tc (s) and the square root is chosen so that Z(t) > 0 for suffi- 
ciently small t > 0. By construction, we have |Z(i(f), t)| = |L@(f), k/2+it)|, the 
multiset of zeros of Z(/(f), t) matches the multiset of zeros of L((f), k/2 + it), 
and Z(t(f), t) changes sign at the zeros of L(u(f), k/2 + it) of odd multiplicity. 


9.2 Computations 


Given 1(f) we would like to compute certain invariants of L(u(f), s). For example, 
the root number ¢, the imaginary part of the first few zeros on the critical line, an 
upper bound on the order of vanishing at s = k/2, the leading Taylor coefficient at 
Ss = k/2, and the plot Z(i(f), t) on some interval. Given that a majority of these 
items cannot be represented exactly, we instead aim to determine a small interval in 
R or rectangle in C. Precisely, let b denote the number of bits of target accuracy. We 
would like to compute the following: 


¢ the root number: x,, ye € Z such that gb+l Re(z) € [xe — 1, x, + 1] and 
2>+1 Im(z) € Ly - 1, y + 1]; 


e the imaginary part of the first few zeros on the critical line: t,...,f, € Z such 
that U,[t — 1,4 + ip! covers the first n zeros of L(u(f), k/2 + it); 
¢ an upper bound on the order of vanishing at s = k/2: r := max;{i 


[LO W(f), k/2)/i!| < 2-2-}}; 
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e the leading Taylor coefficient at s = k/2: 0 #4 s e€ Z such that 
Pt LMa(f), k/2)/r! € [s +1, 5 — 1]; 

* an approximation to the plot of Z(u(f),¢): approximations as doubles of 
Z(t(f), id) for some chosen 6 andi = 0,...,n. 


In order to rigorously compute the items above, we follow an approach that builds 
on several improvements and extensions of the algorithm from [10] specialized to 
the motivic case, the details of which will appear in future work. In practice, given 
the first C, iw N embedded Dirichlet coefficients, with sufficient precision, and while 
carrying out all floating-point calculations using rigorous error bounds and interval 
arithmetic [51], one may compute all the items above to the desired bit accuracy. 
A generic library to carry out such computations, due originally to Dave Platt [27], 
has been developed. 


Example 9.2.1 For an explicit example, we encourage the reader to peruse the 
source file examples/cmf_23.1.b.a.cpp in [27], where the authors show 
how to use the library to compute all of the items above for the modular form 
23.1.b.a, which matches its unique embedding. By running this example, one can 
compute that 


e=(1+£107''7) + (044.7 x 10>”) i, 


(since f is self dual, we must actually have e€ = 1), and 


L(f, 1/2) = 0.174036326987934183499504592018 + 8.2317 x 10-°”, 


as well as approximate values for the imaginary part of the first ten zeros. Using the 
notation above, we can represent an approximation to the imaginary part of the first 
Zero 


5.11568332881511759855335642038 + 3.9443 x 1073! 


by the interval [t; — 1,4 + 1]2710!, where 
ty = 129697980847000609 145 17716069360. 


The imaginary part of the following nine zeros are approximately 7.15926, 8.88140, 
10.2820, 11.4300, 12.9344, 14.6625, 16.4982, 17.1013, and 18.0807. 

We carried out this computation with 100 bits of target accuracy for the 
14 398 359 embedded newforms in our database with k < 200. In our computation 
we observed that it was sufficient to work with 200 bits of precision and Cy < 
0.08k log(k) + 24. While we did not keep track of CPU time used along the way, by 
rerunning some of the computations, we extrapolate that we spent at least 11 CPU 
years on these computations. 
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9.3. Imprimitive L-Function 


Associated to a newform f with coefficient field Q(f) of degree d, we may also 
consider the L-function of degree 2d associated to its Galois orbit: 


LGEos= [| L0G). =[|tG¢r™. (9.3.1) 
rQ(fyeC Pp 


This gives rise to a Q-primitive L-function with L,(f,T) € 1+ TZ[T], which 
satisfies the functional equation 


A(f, 8) = NST e(s)4L(f, s) = eACf, k— 5), (9.3.2) 


where now we have ¢ = +1. Using the invariants for each L((f), 5) mentioned 
above, one can easily deduce the respective invariants for L(f, 5). 

For these L-functions we would also like to compute the local factors for small p. 
This is straightforward if one has access to an exact representation of a, in Q(f). 
Otherwise, we relied on Newton identities to compute L,(f, 7) € Z[T] from the 
roots of Ly((f),T) € C[T], while working with interval arithmetic [51]: see 
L(500.2.e.c) for an example. In some cases, for example when [Q(f) : QJ] or the 
weight is large, we were only able to compute the initial coefficients for some local 
factors—this occurred for L(20.10.e.b), for example. 


9.4 Verifying the Analytic Rank 


In this section, we discuss methods for rigorously verifying the analytic rank of 
a modular form L-function. Throughout, let N and k be positive integers and let 
f € Sk(P1(N); C) be a newform of weight k and level N (with coefficient field 
embedded in the complex numbers). 


Definition 9.4.1 Suppose k is even. We define the analytic rank of f to be the order 
of vanishing of L(f, s) at k/2. 


When L"(f,k/2) # 0 one can certify such a statement using ball arithmetic 
by working with enough precision. However, if L(f,k/2) = 0, this approach 
does not work, as there is no known bound ¢ such that |L“)(f, k/2)| < ¢ implies 
L™ (f, k/2) = 0. Nonetheless, if the order of vanishing is small, then there are other 
methods to computationally verify the order of vanishing. Using these methods we 
were able to provably verify the analytic rank of all modular forms for which the 
L-functions were computed. The way the analytic computations were verified is 
detailed below. The strategy used depends on the order of vanishing, and whether 
the modular form is self-dual or not. The analytic rank zero case is skipped because 
this can just be done by computing L(f, k/2) to enough precision using interval 
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Table 8 Number of even 


ia . aa Analytic rank 
weight newforms in the 
database by analytic rank = | eel ee 
Self-dual 83338 | 85254 | 2565 | 1 0 
Not self-dual 63 804 1798 1 | 0 0 
Total 147142 | 87052 | 2566 | 1 0 


arithmetic until 0 is no longer in the computed interval. A summary of analytic 
ranks of even weight newforms in the database is shown in Table 8. 


Self-Dual and Analytic Rank I We begin by considering self-dual newforms f 
whose analytic rank numerically appears to be 1. All such forms in the range of 
our computation have trivial character. In this case the functional equation takes the 
form 


A(f,s) =i A(f,k—5), (9.4.2) 


where e’ = +1 is the eigenvalue of the Atkin-Lehner involution Wy. For such 
forms in the database, we verified that e/“* = —1, forcing A(f,k/2) = 0, and the 
non-vanishing of N°/*I'c(k/2) then implies that L(f,k/2) = 0. The upper bound 
of 1 on the analytic rank was obtained using interval arithmetic. 


Non-self-dual and Analytic Rank I Following Stein [89, $8.5], we define a pairing 
between modular forms and modular symbols 


Se(Ti(N)) ® Se(T1(N)) x ModSym, (C1 (N)) > C 


by defining 


b b 
(Cf, g), P{a, b}) = / f(z) P(z, Ddz +f g(z)P(Z, 1)dz. 


a 


This pairing allows one to determine the vanishing of L-functions, because for 
every integer | < j < k — 1 we have 


Lf) = ((f,0), XY U-Y 0, 00). 


The pairing is Hecke-equivariant, meaning that ((T, f, Trg), x) = (Cf, g), Tnx) for 
all integers n. 

Let f € S(T \(N)) be a newform and Ve C S7?°“(\(N)) the subspace 
generated by its Galois conjugates. Then by Atkin—Lehner-Li theory, V+ is a simple 
module over the Hecke algebra 7, and there exists a Hecke operator tr € 7 
such that te: My(Ti(N)) — Mx(1i(N)) is a projection onto Vr. Because ty 
is a projection we have ((f,0),x) = ((@¢¢f,tf0),x) = ((f,0),trx) for all 
x € ModSym, (I; (4V)), and hence in particular this means that if 
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tp(XI-ty*®2- GD, oo}) = 0 


then L(f, = = 0. 

A map ty with the same kernel as tf can be obtained from t¢(ModSym, (I (N), 
Q)) in ieeme using the command PeriodMapping. Furthermore, this Magma 
command only uses exact arithmetic over Q. For all non-self-dual modular forms 
whose analytic rank numerically seemed to be 1, it was verified that indeed 
t' (X*/2-lyk/2-140, oo}) = 0, implying that L(f, k/2) = 0. The upper bound of 1 
a the analytic rank was again obtained using interval arithmetic. 


Self-Dual and Analytic Rank 2 As in the preceding subsection, all newforms in 
the database whose analytic rank numerically seemed to be 2 have trivial character. 
This time it was verified that e’:* = 1 for all these modular forms. In particular, the 
functional equation then forces all odd derivatives of A(f, 5) to vanish at k/2. This 
forces the order of vanishing of A(f, s) at k/2 to be even, and hence the analytic 
rank of L(f, 5) to be even as well. The techniques of the preceding paragraph were 
used to prove that for all these modular forms one has that L(f,k/2) = 0, which 
together with the parity argument gives a lower bound of 2 on the analytic rank. The 
upper bound of 2 was again obtained using interval arithmetic. 


Non-self-dual Analytic Rank 2 There is exactly one Galois orbit of non-self-dual 
newforms in the database whose analytic rank numerically seems to be 2. Let f 
denote the newform of weight 2 and level 1154 with LMFDB label 1154.2.e.a with 
coefficient field Q(¢3). This pair corresponds to an isogeny class of abelian surfaces, 
and our first goal is to find a representative of this isogeny class. By searching for 
hyperelliptic curves over F, that match the local factors of L(f,s) for small p, 
and then by lifting their Weierstrass equations to Z we found the following genus 2 
curve: 


C: y? = x® — 12x° + 34x4 — 18x3 — 11x? + 6x +1. (9.4.3) 


Letting J denote its Jacobian, we find it is of conductor 1154? as desired. Our goal 
is first to show that J really is in the isogeny class of abelian surfaces corresponding 
to the newform 1154.2.e.a. Using [26] we were able to compute the endomorphism 
ring of J, and verify that J is of GL2-type and hence is modular [78, 57]. Thus, J 
is a good candidate to be a representative of the isogeny class of abelian surfaces 
corresponding to the newform 1154.2.e.a. Alternatively, one can also verify that J is 
of GL2-type by noting that C has an automorphism of order 3 given by x + 1—1/x, 
y t+ —y/x? and thus showing that its Jacobian is of GL2-type. Additionally, the 
Euler factor at 5 of its L-function is 


12-67 £0? 230r°? 4 sr" 


which is irreducible. Hence its Jacobian is simple, showing that its Jacobian 
corresponds to a pair of Galois conjugate newforms of level 1154. There is one 


Computing Classical Modular Forms 183 


other pair of Galois conjugate newforms whose coefficient field is Q(¢3), namely 
that with LMFDB label 1154.2.c.a. So it remains to show that J does not come from 
the newform with label 1154.2.c.a. However, the Euler factor of the L-function at 
5 for that newform is 1 — 37 + 47? — 157? + 25T* which does not match that 
of J. This means that Jacobian of C really is in the isogeny class of abelian surfaces 
corresponding to the newform 1154.2.e.a. 

Using the Magma function RankBounds one readily computes that J has 
Mordell-Weil rank 4. In particular, it has rank 2 as a module over Z[¢3]. The 
generalization by Kato of the work of Kolyvagin and Logachev on the Birch— 
Swinnerton-Dyer conjecture in the analytic rank 0 and 1 cases to all isogeny factors 
of J; (V) (see Kato [53, Corollary 14.3]) shows that the order of vanishing of L(f, s) 
at | cannot be 0 or | since this would give J rank 0 or | as a Z[¢3]-module. So the 
order of vanishing is at least 2. An upper bound was again obtained using interval 
arithmetic. 


Self-Dual Analytic Rank 3 The approach here is similar to that in the preceding 
paragraph and the result was already briefly mentioned in [31, Section 3.4] where 
the analytic rank is determined for all elliptic curves of conductor N < 130000, 
later extended to all elliptic curves of conductor less than 500000 and rank at 
most 3. There is only one newform that numerically seems to be of analytic rank 
3 in the database, namely 5077.2.a.a of weight 2, level 5077 and trivial character. 
This modular form corresponds to the elliptic curve y* + y = x? — 7x + 6 which has 
rank 3 and is the only one in its isogeny class. The verification that its L-function has 
analytic rank 3 is a famous calculation of Buhler—Gross—Zagier [15], used by Gross— 
Zagier [45] in their solution to the Gauss class number | problem. We confirm it 
quickly as follows: by known cases of the Birch-Swinnerton-Dyer conjecture, the 
analytic rank cannot be 0 or 1; by parity of the root number, the analytic rank cannot 
be 2, so it must be at least 3; and an upper bound on the analytic rank of 3 is obtained 
by interval arithmetic. 


9.5 Chowla’s Conjecture 


The definition of analytic rank (Definition 9.4.1) as an order of vanishing also makes 
sense for k odd, and by analogy one might also find it natural to study the central 
values of L(f, s) at k/2 and their derivatives. However, for k odd the central value 
s = k/2 is not a special value in the sense of Deligne [37] and thus there is no 
abelian group whose rank (as a module over an appropriate coefficient ring) is 
conjecturally related to its leading Taylor coefficient. It would therefore be a stretch 
to call the order of vanishing at the central point an analytic rank. Moreover, one 
does not expect L(f, k/2) to ever vanish, and this is a generalization of Chowla’s 
conjecture for Dirichlet L-functions [21], as follows. 

Let x be a non-trivial Dirichlet character, then the functional equation associated 
to L(x, s) := )° x(n)n™, similar to Eq. 9.1.3, relates L(x, 5) to L(X, 1 — 5). The 
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value of L(x, 1) is quite well understood. For example, the fact that L(1, x) 4 0 
gives us Dirichlet’s theorem on arithmetic progressions, and for primitive real char- 
acters the value L(x, 1) gives us Dirichlet’s class number formula. As mentioned 
above, inspired by Definition 9.4.1, one might also find it natural to study the order 
of vanishing of L(y, 5) at s = 1/2 and its derivatives. However, it is believed 
that L(x, 1/2) # 0; this was first conjectured by Chowla [21] for primitive real 
characters and later generalized to other characters. One of the reasons behind such 
a belief is that for primitive real characters the root number of such L-functions 
is always | [44], and thus there is no simple reason for L(x, 1/2) to vanish. 
Although Chowla’s conjecture remains open, it has been numerically verified for all 
real characters x of modulus less than 10!° [70], and substantial progress towards 
showing the non-vanishing of L(x, 1/2) has also been made, see [44] for a short 
overview. 

A generalization of Chowla’s conjecture is that L(f,k/2) 4 0 for k odd. As 
in the case of Dirichlet L-functions for primitive real characters, we also have that 
the root number of L(f, k/2) can never be —1 when f is self-dual. This is in stark 
contrast to the case of self dual even weight modular forms, where the root numbers 
are split approximately 50-50 between — 1 and 1. We verified this generalization of 
Chowla’s conjecture, as we computed L(f, k/2) for every newform in our database 
with k < 200, and found that this was nonzero for all the odd weight newforms. 


10 An Overview of the Computation 


In this section, we provide an overview of the computations we performed, the 
results of which are now available in the LMFDB [60]. These were accomplished 
using a combination of Magma, Pari/GP, and SageMath scripts, as well as hand 
written C code for some of the more computationally-intensive tasks. In aggregate, 
these computations consumed more than 100 years of CPU time. 


10.1 Data Extent 


Our database consists of four overlapping sets of newforms described in Table 9. 
These datasets were chosen both for reasons of mathematical interest, and to ensure 
that the database included all modular forms contained in existing datasets such 
as the Stein tables of modular forms [88], the Buzzard-Lauder tables of weight 
one newforms [19], and the previous database of modular forms contained in the 
LMFDB. More detailed statistics on the newforms in the database can be found at 
the statistics page. 
For the first dataset (1), we used three independent sources of newform data: 
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Table 9 Extent of the newform database (only nonzero newspaces are included) 


Constraints on S?°“(N, x) Newspaces | Newforms | Embeddings 
(1) Nk? < 4000 30738 67180 | 9966498 
(2) Nk? < 40000, |x| = 1 16277 170611 3.092 301 
(3) Nk? < 40000, k > 1, dim SP°’(N, x) < 100 30345 131540 1648 617 


(4) Nk? < 40000, N < 24 or 

Nk? < 100000, N < 10 or 

N <100,k < 12 7627 12237 676574 
Union of sets above 62 142 281219 14398 359 


¢ Complex eigenvalue data for each embedded newform of weight k > 1 computed 
by the m£1ib software package [8], which uses Arb [51] to rigorously implement 
the trace formula (as described in [85], for example) to obtain approximate 
complex values to a precision of 200 decimal digits. 

¢ Exact algebraic eigenvalue data for each newform of weight k > | and dimension 
d < 20 computed using Magma’s [11] modular symbols package (originally 
written by William Stein). 

¢ Exact algebraic eigenvalue data for each newform of weight k > | and dimension 
d < 20 were computed using the modular forms implementation in Pari/GP [72] 
described in [4], which was also applied to all newforms of weight k = 1. 


For k > 1 and Nk? < 4000 the decomposition of every newspace SEY (N, xX) 
was computed in all three cases and compared for consistency. Exact algebraic data 
was computed only for newforms of dimension d < 20, except for k = 1 where 
exact algebraic data was computed in every case. For newforms of weight k > 1 
and dimension d < 20, the algebraic data independently computed by Magma and 
Pari/GP were checked for consistency (this was not a completely trivial task, as 
it generally required determining an appropriate automorphism of the coefficient 
field in order to compare sequences of Fourier coefficients). We also compared the 
trace forms using all three methods and compared the results for consistency, and 
for newforms of weight k = 1 and level N < 1500 we compared the Pari/GP 
computations with the Buzzard-Lauder database [19]. 

Datasets (2) and (3) were computed entirely in Magma, as was dataset (4), except 
for 12 spaces of high dimension where complex analytic methods were used. For 
the portions of these datasets that overlap with the Stein database of modular forms 
[88], we compared the results for consistency. 

For newforms f = >> ang" of level N < 1000 we computed 1000 coefficients 
an, While for newforms of level 1001 < N < 4000 we computed 2000 coefficients, 
and for newforms of level 4001 < N < 10,000 we computed 3000 coefficients. 
This substantially exceeds the Sturm bound in every case, and also exceeds the 
bound 30./N required for the L-function calculations described in Sect. 9. For every 
newform in the database we computed complex coefficients to a precision of at least 
200 bits. In cases where we compute algebraic coefficient data we computed an 
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optimized representation using an LLL-basis as described in Sect. 8.7, along with a 
set of generators for the coefficient ring. 

For each newform we determined any non-trivial self-twists admitted by the new- 
form (CM, RM, or both), and for newforms with algebraic eigenvalue data available, 
we computed all inner twists as described in Sect. 11. We also computed the analytic 
rank of every newform, as described in Sect. 9.4, and for weight one newforms we 
computed the image of the associated projective Artin representation and a defining 
polynomial for its kernel, as described in Sect. 12. These computations have now all 
been rigorously verified. 

In addition to the newform database, we computed dimension tables for all 
newspaces in the range Nk* < 40,000 with k > 1, and we computed trace forms for 
all newspaces of level N < 4000 in this range using the mft raceform function 
in Pari/GP. 


10.2 Statistics 


In addition to the ability to browse and to search for examples with specific 
properties, the modular forms database allows for an investigation of arithmetic 
statistics. The LMFDB [60] includes precomputed tables displaying how various 
quantities vary across the database, some of which we have duplicated here in 
Tables 10, 11, 12, and 13. 

In addition to these static tables, we have added dynamic statistics 
https://www.lmfdb.org/ModularForm/GL2/Q/holomorphic/dynamic_stats 
which allow users to customize which variables to view and any constraints to 
impose. For example, a researcher might create a table displaying how the weight 
and level vary among forms with complex multiplication. We hope that this new 
feature will enable examination of large-scale patterns, both in the modular form 

data and elsewhere in the LMFDB. 


Remark 10.2.1 The statistics and examples presented in this article reflect the 
dataset defined in Sect. 10.1, which represents the state of the LMFDB as of January 
2020. As new data is added to the LMFDB these statistics may no longer match 
those displayed in the LMFDB, and the number of newforms returned by some of 
the example queries listed below may increase. 


Table 10 Distribution of 


: Analytic rank 0 1 2 3 
analytic ranks : 


Count 191520| 87052) 2566 a 
Proportion 68.12% | 30.96% | 0.91% 0.00% 
Example 23.1.b.a | 37.2.a.a | 389.2.a.a | 5077.2.a.a 
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Table 11 Distribution of projective images 


Projective image A4 S4 As D2 Dn 
Count 458 1033 202 1311 17613 
Proportion 2.37% 5.35% 1.05% 6.79% 91.23% 
Example 124.1.i.a 148.1.f.a 1763.1.p.b 3600.1.¢e.a 3997.1.cz.a 


Table 12 Distribution of inner twists 


Inner twists 


FS 99S |) 129 197 


Count 47 492 


Proportion 26.31% | 45.94% | 16.89% 1.53% | 0.00% | 0.02% 
Inner twists 16 24 44 56 
Count 311 14 | 2 


Proportion 0.11% 


Table 13 Distribution of self twist types by weight 


Weight 

Neither 174853 vy 27877 40 278 255 818 
8.77% 98.27% 87.85% 98.02% 93.91% 90.97% 

CM only 15 841 3074 1538 563 2ola 23 629 
82.05% 1.73% 12.15% 1.98% 6.09% 8.40% 

RM only 461 461 
2.39% 0.16% 

Both 1311 1311 
6.79% 0.47% 


10.3 Data Reliability 


All of our modular form data was computed or verified using rigorous algorithms 
that do not depend on any unproved assumptions or conjectures. 


¢ Self-twists were either verified via Theorem 11.2.4 and Proposition 11.1.7 using 
exact algebraic Fourier coefficients a, or ruled out using complex approxima- 
tions of sufficient precision to rigorously distinguish zero and nonzero values 
of a, and checking for self-twists by all primitive quadratic characters yw of 
conductor dividing the level (a newform that admits a self-twist by y must have 
an = O whenever w(a,) 4 1). 

¢« We computed and verified inner twists for all newforms in our dataset that are 
either of weight one or have dimension at most 20 by computing sufficiently 
many algebraic Fourier coefficients and applying Theorem 11.2.4 and Proposi- 
tion 11.1.7. 
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Analytic ranks were computed using complex approximations as described in 
Sect.9 and then rigorously verified using the symbolic methods described in 
Sect. 9.4. 

For weight one newforms the classification of projective images as Dy, Aa, 
S4, As was rigorously verified by explicitly computing the number field fixed 
by the kernel of the associated projective Galois representation. As described 
in Sect. 12, this was accomplished using a combination of the ray class field 
functionality provided by Pari/GP and Magma, the rigorous tabulation of all 
A4, S4, and As number fields with compatible ramification, and the explicit 
computation of quotients of ring class fields of orders in imaginary quadratic 
fields via the theory of complex multiplication. 


In addition to using mathematically rigorous algorithms, we performed a variety 


of consistency checks intended to catch any errors in the software packages used to 
compute modular forms data, or any errors that might have been introduced during 
post-processing. The following checks have been performed: 


All newforms of weight k > 1 and level N satisfying Nk? < 2000 have been 
independently computed using Magma and Pari/GP. By comparing the results of 
these computations we have verified that the decompositions of each newspace 
s (N, x) into Galois orbits agree (with matching coefficient fields), that the 
first 1000 coefficients of the trace forms for each Galois orbit agree, and for 
newforms of dimension d < 20, that there is an automorphism of the coefficient 
field that relates the sequences of algebraic eigenvalues (a1, ... , @1000) Computed 
by Pari/GP and Magma. 

For all newforms of weight k > 1 and level N satisfying Nk* < 4000 we have 
verified that the trace forms computed by Magma (using modular symbols) agree 
with the trace forms obtained from complex analytic data computed using the 
explicit trace formula. This also verifies the dimensions of the coefficient fields. 
For newforms of weight k = 1 and level N < 1000 we have matched the data 
computed using Pari/GP with the tables computed by Buzzard and Lauder [19]. 
For all dihedral newforms of weight k = | and level N < 4000 we have matched 
trace forms with data computed using the explicit trace formula in Pari/GP with 
data independently computed using the ray class field functionality implemented 
in Pari/GP and Magma. 


As a consistency check for our L-function computations, after computing a 


provisional list of all non-trivial zeros on the critical line up to a chosen height 
bound b we confirmed that no zeros are missing, in other words, that the Riemann 
Hypothesis holds for each L-function up to height b. We use the method described 
in [16] based on the Weil—Barner explicit formula. If an L-function also arises from 
another object in the LMFDB for which we already had computed its L-function we 
verified that these computations match. 


Computing Classical Modular Forms 189 


10.4 Interesting, Extreme Behavior and Examples from 
the Literature 


When putting modular forms in a database it is easy to view them as an aggregate, 
but of course each modular form is distinct and many have unique interesting 
properties. 

We take this opportunity to recall the rich history and special properties of several 
forms in this database. We also provide links between these forms and the literature 
and note several forms that have naturally arisen in previous work. We focus on 
weight k > 2 in this section; see Sect. 12.5 for interesting behavior in weight 
k=1. 


¢ The most well known, and the prototypical, example of a modular form is the 
Ramanujan A function, of weight 12 and level 1; its label is 1.12.a.a. This is 
the lowest weight in which a cusp form appears for the full modular group, so 
many properties of more general newforms were first noticed for A. Similarly, 
A has served as a testing ground for techniques and results before they were 
known more generally. For instance, the Ramanujan—Petersson conjecture was 
first made by Ramanujan for A but later extended to all newforms. Additionally, 
computation of the g-expansion coefficients of A, traditionally denoted by t(n) 
and known as Ramanujan’s t function, is the subject of the monograph [39]. 

e By the modularity theorem, newforms of weight 2 with rational coefficients 
correspond to isogeny classes of elliptic curves over Q. The smallest level in 
which a weight 2 form appears is 11, corresponding to the smallest conductor of 
an elliptic curve over Q. Here we necessarily have trivial character and the label 
is 11.2.a.a; this form has g-expansion 


q| fa-dya-q'"y. 


k>1 


¢ The weight 2 newforms with CM by fields with the largest absolute discriminants 
in the database are 2169.2.d.a with CM by Q(./—723), 8388.2.e.c and 2097.2.d.a 
with CM by Q(./—699), 2061.2.c.c with CM by Q(./—687), and 7524.2.|.b with 
CM by Q(./—627)—the last of these has 8 inner twists. 

¢ The weight 2 newform 867.2.i.a with CM by Q(./—51) has 32 inner twists, and 
the weight 1 newform 3481.1.d.a with CM by Q(./—59) has 56 inner twists. 

* The weight 3 newform 7.3.b.a has CM by Q(./—7), making it the first (by 
analytic conductor) newform of weight > 3 with CM. 

¢ Watkins [99, §9.1.3] discusses several examples of modular forms of analytic 
rank 2. The query https://www.lmfdb.org/ModularForm/GL2/Q/holomorphic/? 
weight=4-&analytic_rank=2- returns 130 forms of weight at least 4 and analytic 
rank at least 2, many of which are mentioned by Watkins, including 2 of weight 8. 

e Watkins also discusses modular forms of weight 2 with which are non-self- 
dual yet have positive analytic rank, particularly examples with quadratic 
character, such as 122.2.b.a. The query https://www.lmfdb.org/ModularForm/ 
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GL2/Q/holomorphic/?weight=2 &char_order=2&is_self_dual=no&analytic_ 
rank=1- produces 567 such examples. In larger weight we have 8.14.b.a which is 
non-self-dual and has analytic rank 1, as does 162.12.c.i. 

¢ The index of the coefficient ring in the ring of integers of the coefficient field can 
get quite large, as in the case of the newform 8.21.d.b where the index is at least 
2!53 .. 315 . 54. 7°. In weight 2, the largest index we computed was 27° - 3+ for 
2016.2.k.b and 4032.2.k.h. 

e« Many newforms in our database have very large Hecke orbits. For example, the 
newform 983.2.c.a has relative dimension 81 over its character field Q(f491) and 
Q-dimension 39 690. 


10.5 Pictures 


For every newform f, every nonempty newspace S$?“ (N, x), and S?° (I; (V)) for 
which we have all the newforms, we have created a portrait based on their trace 
forms, a total of 641 562 portraits. The picture is generated by plotting the absolute 
value of the trace form in the Poincaré disk, obtained as the image of (1 —iz)/(z—i) 
in H, where the color hue represents the absolute value modulo | (with blue being 
zero, and increasing through purple, red, orange, yellow, ...). For example, as the 
trace form is always zero at oo, the top center is always blue, see Fig. 1. 

We deviated from the normal approach, used by complex_plot in SageMath, 
of representing magnitude by brightness (with zero being black and infinity 
being white) and the argument by hue, as this often leads to an overexposed or 
underexposed picture, see Fig. 2. 

Given the number of portraits needed, we limited ourselves to the first 100 
Dirichlet coefficients of the trace form, working with 200 bits of precision, 
evaluating it in a 300 x 300 grid in [—1, 1]?, and storing the picture as a 184 x 184 
PNG. Overall this consumed about 100 CPU days, and their disk footprint is 45 GB. 
For aesthetic reasons, the portraits presented here were computed to a higher quality, 
which creates some discrepancies with the online version, especially in higher 
weight newforms. 

Even though we opted for a plot with less information, it still captures some 
mathematically interesting features. For example, the behavior on the edge of the 
disk is a good indicator for level and weight, see Figs. 3 and 4. 

The size of the blue spot on top center is inversely correlated with the growth of 
the trace form away from oo, thus for fixed weight this is a good indicator for the 
dimension, see Fig. 5: their dimensions are 1, 4, 33, and 120, respectively. 

Finally, one could also be tempted to infer the self-twists of a newform by 
comparing it with other forms in S/°“(I|(N)), see Fig. 6 for ST°” (IT) (164)). 
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Fig. 1 Portrait of 23.1.b.a 


Fig. 2. Portraits of 11.2.a.a and 1.12.a.a and their plots using complex plot in SageMath 


Fig. 3. The portraits for 11.2.a.a, 100.2.a.a, 1001.2.a.a, and 9996.2.a.a 
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Fig. 4 The portraits for 7.3.b.a, 7.9.b.a, 7.27.b.a, and 7.81.b.a 


Fig. 5 The portraits for 9359.2.a.a, 9359.2.a.e, 9359.2.a.k, and 9359.2.a.r 


Fig. 6 The portraits for 164.1.d.a, 164.1.d.b, 164.1.j.a, and 164.1.1.a 


Remark 10.5.1 These portraits differ from those now used in the LMFDB. 
Between writing and publishing this article we chose to instead use the pure 
phase portraits describe in §2.2.5 of [61]. 


10.6 Features 


In parallel to carrying out the computations described elsewhere in this paper, we 
rewrote the user interface to the database. We highlight some of the more prominent 
new features in this section, some of which are being extended to other sections of 
the LMFDB. 

The search interface includes multiple modes for viewing results. After entering 
constraints such as weight, level and dimension, there are four different search 
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buttons available. In addition to the standard list of results, a user can choose to 
go straight to a randomly chosen newform. Alternatively, there are dimension tables 
available which display the dimension of the spaces of newforms as a function of 
weight and level. Finally, a table of traces allows for searching on specific Fourier 
coefficients, including specifying a particular class modulo an arbitrary integer. This 
feature can be used to find modular forms matching geometric objects via point- 
counting. 

All of these search modes are also available for newspaces. For newspaces, the 
list mode shows the dimensions of the corresponding newforms as well as the 
Atkin—Lehner dimensions in the case of trivial character. For both newforms and 
newspaces, users can customize the order of the search results. 

The homepage for an individual newform has also been completely restructured. 
Newforms can be downloaded and reconstructed in Magma, allowing for further 
computations if desired. We include complex eigenvalues for embedded modular 
forms even when exact Fourier coefficients are not feasible to compute. 

One of the key motivations for our extensive computations of (exact or inexact) 
Fourier coefficients of newforms is to allow their L-functions to be computed. 
In addition to providing additional mathematical information about the newform, 
such as its analytic rank and special values, this allows us to automatically connect 
newforms to other objects in the LMFDB. Examples include: 


¢ The L-function L(256.2.a.e) lists both the Bianchi modular form 2.0.4.1-4096.1-b 
and the Hilbert modular form 2.2.8.1-1024.1-m as origins (both arise as base 
changes of 256.2.a.e), as well as the corresponding elliptic curve isogeny classes 
2.0.4.1-4096.1-b over Q(i) and 2.2.8.1-1024.1-m over Q(V/2). 

e The L-function L(72.2.d.a) has (at least) three additional origins: the Hilbert 
modular form 2.2.8.1-81.1-b, the elliptic curve isogeny class 2.2.8.1-81.1-b, and 
the isogeny class 5184.a of the Jacobian of the genus 2 curve 5184.a.46656.1. 

¢ The L-function L(1948.1.b.a) also arises as the L-function of (the Galois orbit 
of) the icosahedral Artin representation 2.1948.24T576.1. The L-functions home 
page also lists the four conjugate Artin representations (and four embedded 
weight one newforms) whose L-functions are primitive factors of this imprimi- 
tive L-function of degree 8. 


11 Twisting 


In this section, we discuss twists of modular forms and related computational issues. 
For background and further reading, we refer the reader to the foundational articles 
by Ribet [75, 76]. 
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11.1 Definitions 


We begin with definitions, followed by some examples. Throughout this section, 
let f € S{°%(N, x) be a newform of weight k € Zs, level N € Z>1, and 
character x, and let Ke := Q({an(f)}n) © C be its coefficient field. Let y be 
a Dirichlet character of conductor cond(y), and let yw be the primitive Dirichlet 
character inducing y (with cond(wo) = cond(y)). Then there is a unique newform 
g := f ® w characterized by the property that 


an(g) = Wo()an(f) for all n coprime to N cond(w); (11.1.1) 
we call g the twist of f by w. However, more is true: in fact, we have 
an(g) = Wo()an(f) for all n coprime to cond(y) (11.1.2) 


including those n that are not necessarily coprime to N cond(y): see Atkin—Li 
[2, Theorem 3.2]. By the recurrence satisfied by the Hecke operators, (11.1.2) is 
equivalent to the condition 


ap(g) = W(p)ap(f) for all p { cond(y). (11.1.3) 


The newform g has character x (by (11.1.8) below) and level dividing 
Icm(N, cond(w) cond(x w)) (by Lemma 11.2.1 below). We call the newform g 
the twist of f by y and say that g is a twist of f. 

As above, the group Aut(C) acts on the set of newforms in Se (N ,X), with 
an(o(f)) = o(an(f)) for alln > 1. We have o(f) € S/°Y(N,o(x)), where 
o(x)(n) = o(x(n)) for alln > 1. If g = f @y, theno(g) = o(f) ®a(W) for all 
o € Aut(C). Accordingly, the set 


[fleWl:={few:felflw ew) (11.1.4) 


has an action of Aut(C) and so consists of finitely many Aut(C)-orbits (possibly 
more than one). Accordingly, we say that [g] is a twist of [f] by [y] if there exist 
f' € [fl Ww’ € (Wl, ge’ € [g] such that g’ = f’ ® W’, or equivalently, [g] C 
[f1@ [wl. 


Example 11.1.5 The newform orbits 3380.1.v.e and 3380.1.v.g are both twists of 
3380.1.g.c by 13.f (and by 260.bc). 


With this Galois digression out of the way, we return to the treatment of twists of 
(embedded) newforms. 


Definition 11.1.6 Let wy be a Dirichlet character and o: Kf <= C be a field 
embedding. We say that f admits an inner twist by the pair (W, 0) if f@ wv =oa(f). 
In the special case that o = id|x,, we say that f admits a self-twist by y. 
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Let InnTw(f) denote the set of inner twists of f and Selffw(f) C InnTw(/) the 
subset of self-twists. Then projection onto the first component identifies SelfTw(f) 
with a subgroup of Dirichlet characters. By (11.1.2), the form f has an inner twist 
by (W,o) if and only if o(a,) = W(n)a, for almost all n. The twist is said to 
be inner because such twists stay “within” the Galois orbit of f (a nontrivial inner 
twist is sometimes also referred to as an “extra twist”). Every newform has a trivial 
self-twist by (1.a, id IK): 


Proposition 11.1.7 (Ribet [76], Momose [68]) The following statements hold. 
(a) If(W,o) € InnTw(f), then 


a(x) = x’: 


so if w € SelfTw(f) then w is quadratic. 
(b) If(W,o) € InnTw(f) theno € Aut(K f). 
(c) InnTw(f) naturally forms a group under 


(ho): (WW, 0") = (Wo(p’), a0’). 
(d) There is an exact sequence of groups 


1 > SelfTw(f) — InnTw(f) ay Aut(K f) 
(Wa)Pho. 


Let A := m(InnTw(f)). Then InnTw(f) ~ SelfTw(f) x A is a direct product. 

(e) The projection (W,o) t w from InnTw(f) to the set of Dirichlet characters is 
an injective map of sets. 

(f) The group A is abelian. 

(g) Suppose SelfTw(f) is trivial. Then x is an isomorphism and the assignment 
o> Wo ifand only if (Wo, 0) € InnTw(f) is a well-defined 1-cocycle, i.e., 


Woo! = WoO (Wo). 


Proof These results originate with Ribet [76, §3] and Momose [68, Lemma (1.5)], 
but they work under the hypothesis that f has no self-twists. For clarity, we repeat 
these arguments to show this hypothesis is unnecessary. Let f(q) = ©, nq”. 

Part (a) follows by looking at (Nebentypus) characters using the Hecke recur- 
rence (or the determinant of the associated Galois representations). Explicitly, on 
the one hand, the character of o(f) is o(x); on the other, if € is the character of 
Ff ® w then for all good primes p the Hecke recurrence reads 


e(p)p'| =ap(f @ YW) —apl(f @ vy 


7 ; ~~ - : cs (11.1.8) 
= Wp) (ap (SY — 4,2(f)) = WP)" x (Pp) P 
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so € = xw*. Consequently, a self-twist by w gives xy = x, so W? is the trivial 
character. 

For part (b), by (a) we have Ww? = 0(x)x, and we claim w takes values in 
Q(x): indeed, if x(1) = ¢ is a primitive dth root of unity, then checking cases 
based on the parity of d reveals that 0 (¢)/¢ € (¢7). Since Q(w) C K ¢, we conclude 
oO (an) = W(n)dy € K ¢ for almost all n, so o(K ¢) C Kf as desired. 

For part (c), we start with o’(a,) = W’(n)a, and apply o to get 


—1 


(00")(Gn) = 0 (')(n)o (Gn) = oY) (0) (n)an 


for almost all n, so (Wa(w’), oo’) € InnTw(f). This product is associative: the 
identity element in InnTw(/f) is (1.a, id|x ¢)s and inverses are given by (vy, o) t= 
(o!(),o7'). 

In part (d), the exact sequence is evident from (c). The group InnTw(f) visibly 
has the structure of a semidirect product InnTw(f) ~ Selffw(f) x A via A > 
Aut(SelfTw(f)) by o & (WwW bh a(w)). However, by (b) SelfTw(f) consists only 
of quadratic characters, so o(W) = w for all o so the product is direct. 

Part (e) follows from the fact that w uniquely determines o. 

Part (f) is claimed by Ribet [76, Proposition (3.3)]: we prove it as follows. As in 
(a), let x(n) = € ando(x)(n) = cf Then again w(n) = gene (for some choice 
of square root of ¢). Write similarly o’(x)(n) = ce Then 


a Ch) ch D2 tase 
ie = ¢&-De-D/2 


v ck-1/2 


is well-defined, and by symmetry this is equal to (0 (w)/w)(n), giving Wo(w’) = 
w’ o'(w), and similarly o’(x)(n) = c* This calculation shows the projection of the 
products (7, o)(W’, 0") = (Wa(w’), a0") and (W’, o')(W, 0) = (W'o'(), oo") 
agree. By part (e), it follows that oo’ = o’o and A is abelian. 

Finally, part (g) is immediate from (c). 


Example 11.1.9 Consider the (embedded) newform 180.1.m.a.107.2; it represents 
the unique newform orbit in the space 180.1.m of weight 1 and level 180 with 
character orbit 180.m, whose g-expansion begins 


f@=4-60 -@at+ a —%q' +0"), 


where ¢g = exp(27i/8) = (1 + i)/</2 is the primitive eighth root of unity in the 
upper quadrant and K ¢ = Q(ég). 

The group Selffw(/) of self-twists is of order 2 with nontrivial character 4.3, 
the quadratic character of conductor 4 associated to the field Q(./—1). The group 
of inner twists has order #InnTw(f) = 8, and we compute InnTw(f) ~ (Z /2Z)°, 
generated by the elements 

(4.3, id), (8.2, fg > —g), (5.3, fg > Sg). 
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The character w5 with label 5.3 has order 4, so letting 03 € Aut(Q(ég)) by 
03(fg) = . we have 


(Ws, 03)? = (Ws 03(Ws), 03) = (WsW5 |, id) = 1. 


The projection of InnTw(f) onto the set of characters yields characters with 
conductors 1, 3, 4, 5, 12, 15, 20, 60. 


Example 11.1.10 For f with label 361.2.e.d and Kf = Q(é1g), we have no 
nontrivial self-twists and 7: InnTw(f) — Aut(K yf) is an isomorphism onto its 
image. In fact, we compute that z is surjective, so InnTw(f) ~ Z/6Z. More 
precisely, the elements of order 3 in InnTw(f) correspond to the characters 19.7 
and 19.11 of order 3, and in the character orbit 19.e there are three characters whose 
elements match with automorphisms of order 2 and two of order 6. 


Example 11.1.11 Among the forms of weight k = 2, trivial character, and 
dimension 2, we can search for forms with inner twist, and we should see a table 
that matches Cremona [29, Table 3] up to level N < 300. The lists match with one 
exception: we found one form 169.2.a.a that was missed by Cremona. 


Newforms of weight k > 2 that admit nontrivial self-twists are commonly said 
to have complex multiplication, for reasons we now explain. 


Proposition 11.1.12 (Ribet) The following statements hold. 


(a) Ifk => 2 and f has nontrivial self-twist by w, then is associated to an 
imaginary quadratic field and is unique, i.e., SelfTw(f) ~ Z/2Z. 

(b) Ifk = 1, then f has nontrivial self-twist by w if and only if f has dihedral 
projective image. If so, then w may be real or imaginary and SelfTw(f) is a 
subgroup of (Z/2Z)°. 


Proof For part (a), see Ribet [75, Theorem (4.5)], a consequence of the theory of 
complex multiplication. 

For part (b), we recall Sect. 12 and observe that f has self-twist by wy if and only 
if ap(f) = 0 for all p inert in Q(w) and by classification this happens if and only if 
the image of the projective Galois representation is dihedral. In this case, let L be the 
fixed field of the kernel of the projective Galois representation associated to f, so 
Gal(L | Q) ~ Dy, the dihedral group of order 2n. Then for each quadratic subfield 
F C L, the form f has self-twist by the character associated to F'. Accordingly, 
when n > 2 the subfield F and associated self-twist character are unique, and when 
n = 2 (so K is biquadratic) there are three distinct subfields and corresponding 
characters and there is a real quadratic subfield. 


Example 12.5.1 shows that forms in Proposition 11.1.12(b) indeed occur. In light 
of Proposition 11.1.12, we make the following definition. 
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Definition 11.1.13 We say f has real multiplication (RM) if f has self-twist by a 
character attached to a real quadratic field and complex multiplication (CM) if f has 
self-twist by a character attached to an imaginary quadratic field. 


Remark 11.1.14 It is common in the literature to just replace the term self-twist 
by complex multiplication. By Proposition 11.1.12(a), there is no harm in this for 
weight k > 2, but for weight k = 1 we think this is potentially confusing, and we 
want to avoid saying “f has complex multiplication by Q(/5).” 


Example 11.1.15 As in the proof of Proposition 11.1.12(b), weight 1 forms can 
have RM or CM or both. Forms with RM correspond precisely to ray class 
characters of real quadratic fields that are of mixed signature (i.e., even at one real 
place and odd at another). 


Example 11.1.16 CM modular forms may also have an inner twist that is not a self- 
twist: the smallest example by analytic conductor is 52.1.j.a, having CM by Q(./— 1) 
and two inner twists that are not self-twists. This phenomenon is not restricted to 
weight 1, for example the same is true of the form with label 20.2.e.a. 


Continuing with the theme of working with newforms that have not yet been 
embedded, we conclude this section by showing that the inner twist group is well- 
defined on the Galois orbit. 


Lemma 11.1.17 For all t € Aut(C), we have an isomorphism of groups 


InnTw(f) > InnTw(t(f)) 


(11.1.18) 
(Wi ao) (ty, tot” !). 


Proof From o (an) = a, (n) for almost all n we conclude 
(tot !)(t(an)) = Tan) (tTW)(n) 


for almost all n, and conversely. 


11.2. Detecting Inner Twists 


With definitions out of the way, we now drill down to precisely understand the 
level of twists. We keep notation from the previous section, in particular f(q) = 
yen an(f Yq" € SR (N, x) is anewform and yw is a Dirichlet character of conductor 


cond(w). 


Lemma 11.2.1 Let M be the level of f ®, so f @W € S*(M, xw7). Then the 
following statements hold: 


(a) For all primes p, we have the inequality 
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ord,(M) < max(ord, (N), ord, (cond(y) cond(x v))), 


with equality if ord,(N) 4 ordp(cond(w) cond(x w)). In particular, M divides 
Iem(N, cond(y) cond(x y)). 
(b) For all primes p we have 


ord, (cond(y)) < ord, (cond(w) cond(x w)) < max(ord,(N), ord,(M)). 


In particular, cond(w) cond(x w) divides lcm(M, N), and if M divides N, 
then cond(y) cond(x vv) divides N. 


Proof Statement (a) can be found in Booker—Lee—Strombergsson [9, Lemma 1.4]: 
this improves the upper bound of Shimura [86, Proposition 3.64] and Atkin—Li [2, 
Proposition 3.1] that 


M | Icm(N, cond(y)’, cond(x) cond(y)), (11.2.2) 


which can be proven directly. 
For statement (b), we prove the contrapositive. Let p | cond(y) cond(x y) and 
suppose that ord, (cond(y) cond(x y)) > ord,(N). Then by (b) we have 


ord,(M) = ord,(cond(y) cond(x w)) > ordp(N). 


Lemma 11.2.3 Jf ap(f) 4 O for some prime number p, then ordp(N) € 
{1, ord, cond(x )}. 


Proof If ordy(N) = 0, then ord,(cond(x)) = 0. If ord, cond(x) ¥ ordp(N), ie., 
x is a character modulo N/p, then ap(f) 4 0 implies ordp(V) = 1 by a result of 
Li [59, Theorem 3]. 


We recall by Proposition 11.1.7(b) that if (W,o) e€ InnTw(f), then o ¢€ 
Aut(K ¢). But since we do not need this in the proof, we state the following theorem 
more generally. 


Theorem 11.2.4 Let f(q) = >0,, an(f)q" € SP (N, x), and leto € Gal(K f |Q) 
where Kr © Cis the Galois closure of K ¢. Let w be a primitive Dirichlet character, 
and let w' be the primitive character that induces xw. Then f ® Ww = a(f) if and 
only if all of the following conditions hold: 


(i) cond(y) cond(y’) | N; 

(ii) x? = 0(x); and 
(iii) o(@p(f)) € {ap(f)W(D), ap (fw (p)} for all primes p < Sturm(k, N). 
Proof Let f € St°*(N, X) denote the dual of f, with coefficients a,(f) = an(f). 
Thus f = f @xX (ef. Atkin-Li [2, Proposition 1.5] or Ribet [75, $1, p. 21]) and 
consequently f @w = f @w’ as 
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an fw! (1) = an( AX (XW)M(n) = an( PW) 


whenever gcd(n, N) = 1. 

First we prove (=), and suppose that f ® yY = o(f). By Proposition 11.1.7 
we have x? = a(x). Since cond(o(f)) = cond(f) = WN, we have 
cond(y) cond(y’) | N by Lemma 11.2.1(c). Let D := gced(cond(y), cond(w’)). 
Then 


cond(x)=cond(y'y) | lem(cond(w), cond(w’))=(cond(y) cond(y’)/D) | (N/D). 
(11.2.5) 

Let p be prime. If p { cond(y) then a(ap(f)) = ap(f ® w) = ap(f)w(p). 
Similarly, if p { cond(y’) then o(ap(f)) = an(f @ w') = ap(f)w'(p). Hence we 
may suppose that p | D, so by (11.2.5) we have ord,(N) > max{1, ord, cond(x )}. 
By Lemma 11.2.3, it follows that ap(f) = 0, and thus o (ap(f)) = ap(f)W(p). 

Now we prove the converse (<=), and suppose that conditions (i)—(iii) hold. Let 
M be the level of f @ w. Let Q denote the product of primes p | N such that 
either 


* p{M,or 
° ap(f) =Oandap(f @ yp) £0. 


Let € denote the trivial character modulo Q, and define 
[o,@) 
8(q) = > an(f @ W)E(n)q". (11.2.6) 
n=1 


We claim that conditions (i)-(ii) imply that g € S,(N, x’). By Atkin-Li [2, 
Proposition 3.1] it suffices to show that 


Icm(M, cond(ww’)Q, Q’) | N. (11.2.7) 
By Lemma 11.2.1(a) and the fact that cond(y) cond(y’) | N, we have 
cond(ww’) | M | Icm{N, cond(y) cond(w’)} = N, 
so to prove (11.2.7) it suffices to show that ord,(N) = 1-+max{1, ord, cond(w w’)} 
for all primes p | Q. 

Let p be such a prime. Then either p { M ora,(f) = ap(f) = 0 and a,(f ® 
wv) = a,(f ® w’) ¥ 0. In either case we must have p | ged(cond(y), cond(y’)) 
and, by Lemma 11.2.3, ord,(M) € {1, ord, cond(wy’)}. It follows that 

max{1, ord, cond(yw’), ord, (M)} < max{ord, cond(w), ord, cond(p’)}. 


Since p | gcd(cond(y), cond(w’)), we have min{ord, cond(y), ord cond(w’)} > 
1, and hence 
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ord, (cond(w) cond(w’)) > 1+ max{1, ord, cond(wy’), ord,(M)}. 
By Lemma 11.2.1(b) we have 
ord,(N) = ord, (cond(y) cond(w’)) > 1 + max{1, ord, cond(ww’)}. 


This concludes the proof that g € S;,(N, x w). 

Next, we claim that a,(g) = o(a,(f)) for all n < Sturm(k, NV). Since both 
sequences are multiplicative and xy y* = o(x), it suffices to verify this equality at 
primes, p. There are three cases to consider: 


° Ifpt{N thenap(f)v(p) = ap(f)W'(p), so that o (ap(f)) = ap(g). 
° Ifp|N anda,(f) = 0 then ay(g) = 0 by construction, and o(ap(f)) = 0. 
* Ifp| N andap(f) #0 then0 A o(ap(f)) € fap(PW(p), ap( fw (p)}. 


— Ifo(ap(f)) = ap(f)W(p) then p { cond(y), so ap(f)W(p) = ap(f ® ). 
— Ifo(ap(f)) = ap(f)'(p) then p { cond(y’), so 


ap (fw (p) = ap (f @W') =4,(f @W). 


In either case, we conclude that o (ap(f)) = ap(f ® W) = ap(g). 


By the Hecke—Sturm bound (Proposition 8.2.3), it follows that g = o(f). 
Finally, since f is a newform, o(f) is as well, and thus o(f) = g = f @y, 
by strong multiplicity one. Oo 


We conclude with a variant, similarly useful for algorithmic purposes. We recall 
the notion of distinguishing primes from Sect. 8.9. 


Theorem 11.2.8 With the same hypotheses as in Theorem 11.2.4, we have f@w = 
o(f) ifand only if all of the following conditions hold: 


Gi) cond(y) cond(xy) | N; 

(ii) xW? =o(x); 
(ili) o(ap(f)) = ap(f) (Pp) for all primes p < Sturm(k, N) with p { N; and 
(iv) o(ap(f)) = ap(f)W(P) for p in a set of distinguishing primes for f. 


Proof The implication (=) is clear, so we prove (<=). 

As in the proof of (<=) of Theorem 11.2.4, we again consider the form g as in 
(11.2.6) with € the trivial character modulo Q. Let Ng be the level of g. Then in 
the proof we showed that Ng | N andh := g—oa(f) € Sx(N, o(x)). By (iil) and 
Hecke recursion, we have a, (h) = 0 for alln < Sturm(k, N) coprime to NV. 

If Ng = N, then by (iv), we have o( f) = f @ w. So we may assume that Ng 
is a proper divisor of N. We now employ degeneracy operators to upgrade (iii). It is 
convenient to switch from lower-triangular to upper-triangular matrices. Let 


rN) := {y €SLi(Z):y = (: ") (mod | 
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and similarly [°(N), and define spaces of modular forms on these groups similarly. 
We refer to Diamond—Shurman [38, §5.7] for the results we need. The groups 
T',(N) and '!(N) are conjugate by the matrix (; ) giving an isomorphism 


tn = Se(T1(N)) > Sx(P!(N)) whose effect on Fourier expansions is Ye ong" 
yen bngy Where qy := exp(2miz/N). Moreover, this map preserves the Neben- 
typus character. For any d | N, the trace operator defines a map 


ma: Se(U'(N)) > Sea) S Se(P1(N)) 
where Tg := T'y(N)N T°(N/d): its effect on Fourier expansions is 


lee) [o@) 
> bag > ~ bngny- 


n=l n=1 
d\n 


The operator zg is a projection operator, and for d, d’ | N with gcd(d, d’) = 1 we 
have wgiq = qT. 


Consider 
h’ := |] — mph) € (PN), x). 
p|N 
By construction, multiplicativity, and (iii), we have a,(h’) = O for alln < 


Sturm(k, NV). Then by the Hecke—Sturm bound (Proposition 8.2.3), we conclude 
h’ = 0. Thus 


h(gy= > an(h)q”. (11.2.9) 


n=1 
gcd(n,N)A1 


We have realized h as a sum of oldforms. Turning this back to | (4), we conclude 
that 


h(q) = >> hp(q?) (1.2.10) 


p|N 


with hp(q) € Sx(Up,o(X)p), as in the oldform theory of Atkin—Lehner [1, 
Theorem 1] and Li [59, Corollary 1]; moreover, h, = 0 if and only if h is new 
at p. 

We now show that h = 0. Let p | N. If x is not a character modulo N/p, then 
Sk p, o(X) p) = 0 so hy = 0. So suppose x is a character modulo N/p. 


¢ Suppose that ap(f) #4 0. Then by Lemma 11.2.3, we have p || N. Thus 
ord,(cond(x)) = 0, so by (i) we have ordp(V) = 2ordp(cond(y)). If 
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ord,(cond(y)) = 0, then we have twisted by a character trivial at p, so 
ord,(M) = ord,(N) by Lemma 11.2.1(b). Therefore f @ wy is new at p, so 
g is new at p and a),(g) = ap(f @ W) soh, = O. If instead ord, (cond(w)) = 1, 
then p? | N, a contradiction. 

¢ Suppose ap)(f) = 0. If ap(f ® w) F 0, then by construction, ap(g) = 0 so by 
multiplicativity a,(f) = an(g) for all p | n; therefore hp, = 0. 


We have shown that o(f) = g. We then conclude as in the end of the proof of 
Theorem 11.2.4. Oo 


Example 11.2.11 Consider the space 24.2.f.a. There are two Galois-conjugate 
newforms with the same Nebentypus character. The Sturm bound is 8, but the 
smallest p { N where the Fourier coefficients differ is 11. In particular, this shows 
that in the Hecke—Sturm bound (Proposition 8.2.3) we cannot ignore the primes 
p\N. 


The virtue of Theorems 11.2.4 and 11.2.8 is that they give explicit criteria to 
certify inner twists, with care taken concerning primes dividing the level. 


11.3, Computing Inner Twists 


We used Theorem 11.2.8 to compute the complete group of inner twists for all the 
modular forms in our dataset. Specifically, we enumerate the finite set X of Dirichlet 
characters y satisfying condition (i) of Theorem 11.2.8 for which xy? is conjugate 
to x. Note that the set X does not depend on f/f or its coefficient field, only the 
character x and level N. We then determine the subset of X that satisfy conditions 
(iii) and (iv) for some o0 € Gal(K f) as follows: 


(1) We first remove from X all characters w for which there is a prime p < 
Sturm(k, NV) not dividing N such that ap(f)y(p) is not conjugate to ap(f); 
this is accomplished by comparing the minimal polynomials of ap(f)w(p) and 
ap(f)- _ 

For all remaining yy ¢€ X, set T := Gal(K yf) and for successive primes p < 
Sturm(k, N) with p { N, replace T with {o € T: a(ap(f)) = ap(f)v(P)}, 
stopping if T becomes empty. This yields a list of candidate inner twists (y, 7) 
containing InnTw(/). 

(3) Finally, for each candidate (w,o) we check whether (iv) holds; if so then 

Theorem 11.2.8 implies that (wy, o) is an inner twist of f. 


(2 


wm 


As shown by Example 11.2.11, the third step above is potentially necessary, but 
in our computation we never encountered a case where a candidate inner twist that 
survived step (2) was discarded in step (3). 


Remark 11.3.1 The Magma function InnerTwists implements a weaker form 
of Theorem 11.2.4. It requires checking eigenvalues up to the Sturm bound for 
level lem(N, cond(w)?, cond(y) cond(x )), and it performs eigenvalue comparisons 
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using complex approximations that do not guarantee a rigorous result. Indeed, even 
when the optional parameter Proof is set to True, Magma version 2.24—7 displays 
the following message: 

WARNING: Even if Proof is True, the program does not prove 


that every twist returned is in fact an inner twist (though 
they are up to precision 0.00001). 


12 Weight One 


Modular forms of weight one are of particular interest due to the connection with 
Artin representations, provided by a theorem of Deligne and Serre [36]: one can 
associate to each weight one newform f an odd irreducible two-dimensional Galois 
representations p¢: Gq — GL2(C) for which L(f,s) = L(pyz,s) (recall that a 
Galois representation is odd if complex conjugation has determinant — 1). Following 
the proof of Serre’s conjecture by Khare and Wintenberger [57], we now know 
that the map f +> pf is in fact a bijection. This connection allows one to attach 
several additional arithmetic invariants to weight one newforms that we would like 
to compute, including: 


* The projective image of o¢ in PGL2(C), which by Klein’s classification is isomor- 
phic to either D,, (dihedral of order 2n, including Dz := Z/2Z x Z/2Z), or one 
of the exceptional groups Aq (tetrahedral), S4 (octahedral), or As (icosahedral). 


° The projective field of py: the fixed field of the kernel of Ga mar GL2(C) — 
PGL(C). 

* The Artin image of p;: the finite group p s(Gq@) < GL2(C). 

¢ The Artin field of py: the fixed field of ker pf, with Galois group isomorphic to 
pf(GQ)- 


One can also consider the projective representation pr: Gg —> PGL»2(C) 
induced by p+ as an invariant in its own right: it uniquely determines the twist class 
of f. Two newforms f and g are said to be twist equivalent if g = f ® w for some 
Dirichlet character y, and in weight | this occurs if and only if p¢ = fg. 


12.1 Computational Observations 


The Deligne—Serre theorem also has important computational implications. In the 
typical case where p+ is a dihedral representation (meaning that its projective image 
is dihedral), the Artin L-function L(p+,s) is also the Weber L-function L(w, s) of 
aray class character w of the quadratic field K fixed by the preimage of C, C D, ~ 
Pf (Gq). (Forn = 2 there are three choices for Cz € D2; we can use any one of the 
three.) The quadratic field K and the ray class character w necessarily satisfy 
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|dx | Nm(cond(w)) = cond(pr) = N, (12.1.1) 


where dx is the discriminant of K and N is the level of f. In order to obtain an odd 
representation of we also require that if K is a real quadratic field then the modulus 
for w should include exactly one of the real places of K. 

For any given level N, it is straightforward to enumerate all quadratic fields K of 
discriminant dx | N, all Ox-ideals of absolute norm dividing N/|dx|, and all ray 
class characters w of K for the modulus with finite part J and infinite part compatible 
with an odd representation. This makes it feasible to explicitly compute Fourier 
expansions of all dihedral newforms of level N to any desired precision; to compute 
ap(f) for p { N this simply amounts to evaluating the corresponding ray class 
character w at the prime ideals of Ox above p. 

Pari/GP contains extensive support for computing with ray class characters that 
are particularly efficient in the case of quadratic fields. We used this to compute all 
dihedral newforms of level N < 40,000 with Fourier coefficients a,(f) computed 
for n < 6000 (well past the Sturm bound). This yielded a total of 572,462 
dihedral newforms, corresponding to 14,634,052 embedded newforms. The largest 
dimension we found was 2818, which arises for a dihedral newform of level 39473, 
and the largest projective image we found was D246 for a newform of level 39,851. 

These computations go far beyond the extent of our database described in 
Sect. 10.1, which only covers levels N < 4000 in weight one. For comparison, 
the largest dimension arising for N < 4000 is 232 and the largest projective image 
is Dzg5. The reason for this discrepancy is that while it is computationally very 
easy to compute dihedral newforms, to obtain a complete enumeration of all the 
newforms in a given weight one newspace, one must also enumerate the tetrahedral, 
octahedral, and icosahedral newforms, which is more difficult—particularly in the 
icosahedral case. Interestingly, the main difficulty often lies not in enumerating 
these exceptional newforms, but in verifying that one has actually found them all. 
In contrast to the case k > | where there are well known dimension formulas, while 
there are computational tricks that work well in special cases, to our knowledge no 
efficient method for computing dim S]°“(N) for general N is currently known. 


12.2 Classifying the Projective Image 


The Pari/GP function mEgaloistype can be used to classify the projective image, 
but given that we actually computed the projective field in every case (which of 
course determines the projective image), we did not exploit this feature. 


Remark 12.2.1 Buzzard—Lauder [19] describe an approach to classifying the 
projective image by computing projective orders of elements that they applied to 
all weight one newforms of level up to 1500. They note in their paper that their 
approach relies on the convenient fact that there are no weight one newforms of 
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level N < 1500 with projective image A4 whose coefficient field contains Q(¥/5). 
Five such examples arise in our dataset, the first of which is 2299.1.w.a. 


12.3. Computing the Projective Field 


Our strategy for computing the projective field is to exhaustively compute a 
complete set of candidates and then rule out all but one. As noted in Sect. 12.1, we 
can effectively determine all the dihedral forms at each level, so we know in advance 
exactly which forms are dihedral (and the exact order of the projective image in each 
of these cases). In cases where a dihedral image has moderate degree—less than 
100, say—it is feasible to use the ray class field functionality in Pari/GP to compute 
the projective field. This notably includes all of the dihedral projective fields whose 
distinguished quadratic subfield is real: the largest such example in our database is 
2605.1.bd.a with projective image Do. 

The dihedral fields in which the distinguished subfield is imaginary quadratic can 
be much larger: the largest example 3997.1.cz.a has projective image Dogs. In these 
cases, we exploit the fact that every dihedral field whose distinguished quadratic 
subfield is imaginary can be realized as a subfield of a ring class field that can be 
explicitly computed using the theory of complex multiplication. There is a well- 
developed theory for efficiently computing these ring class fields, even in cases 
where the degree may be in the millions, motivated by applications to cryptography 
and elliptic curve primality proving (the CM method for constructing elliptic curves 
over finite fields). 

Given a dihedral weight one newform f € S}*“(N, x) with dihedral image 
D,, and distinguished imaginary quadratic field K, there is a finite set of possible 
suborders O of Ox and conductors c such that the projective field of f arises as a 
cyclic degree-n extension of K of conductor c contained in the ring class field K of 
O. The enumeration of these dihedral fields was achieved using an algorithm based 
on the techniques developed by Enge-Sutherland [43] and Sutherland [92, 93] that 
will be described in a forthcoming paper. 

Having enumerated a complete list of candidate fields L := Q[x]/(g_(x)), for 
successive primes p { N we can compute the order of pf(Frob,) in PGL;(C) 
by determining the positive integer n for which ap(f 2/x(pP) = in + on 142 
and compare this to the inertia degree of the primes above p in O,. This will 
eventually eliminate all but one candidate field, since the sequence of inertia degrees 
uniquely determines a Galois number field, and in practice this happens very 
quickly. To accelerate the computation we precompute defining polynomials for the 
real cyclotomic fields we may encounter and use p coprime to the discriminants of 
the defining polynomials gz; so that we can compute the inertia degree as the degree 
of the irreducible factors of g7 (x) in Fp[x]. 

For the non-dihedral projective images we used the methods of Cohen—Diaz 
y Diaz—Olivier [22, 23] to enumerate all A4 and Sy fields unramified outside a 
given set of primes, and for the As fields we used existing tables of fields in 
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the Jones—Roberts database and the LMFDB combined with a targeted Hunter 
search for some missing cases, as described by Jones—Roberts [52]. This allowed 
us to construct complete lists of candidate fields for each non-dihedral weight one 
form from which we then ruled out all but one candidate by comparing orders of 
Frobenius elements with inertia degrees as described above. 


12.4 Computing the Artin Image, the Artin Field, and the 
Associated Artin Representation 


As of January 2020 the LMFDB contained 5116 odd two-dimensional Artin repre- 
sentations of conductor N < 4000, all of which we were able to uniquely match to 
a corresponding newform of weight one. For each of these Artin representations the 
LMFDB provides the Artin image, the Artin field, and a complete description of the 
Artin representation given values on each conjugacy class of Frobenius elements. 
We were also able to compute the Artin image and Artin field for 833 additional 
weight one newforms that are twists of a weight one newform for which we know the 
corresponding Artin representation by taking the compositum of the known Artin 
field with an appropriate cyclotomic field. 

There is work in progress to add as many of the Artin representations correspond- 
ing to the remaining 14,190 weight one newforms as possible; these will be linked 
to the corresponding weight 1 newforms as they become available. 


12.5 Interesting and Extreme Behavior 


Weight one modular forms behave rather differently than those of higher weight. As 
seen in Sect. 12, one important invariant of weight one forms is the projective image 
of the associated Galois representation. We will discuss some forms with dihedral 
projective image first. 

Hecke also constructed weight one modular forms starting from imaginary 
quadratic fields with odd class number at least 3. The first examples of such fields 
come from Q(./—23), Q(/—31), Q/—39), and the corresponding modular forms 
are the three smallest level weight one newforms; these have labels 23.1.b.a, 31.1.b.a 
and 39.1.d.a, respectively. [46] 


Example 12.5.1 The last of these, 39.1.d.a, is the D2 form of lowest level and has 


CM by both Q(./—3) and Q(./—39), and RM by Q(vV 13). This form appears in 
work of Darmon—Lauder—Rotger [34, Example 2.5]. 


The first examples of newforms with RM but no CM occur in level 145 with 
145.1.f.a (RM by Q(v5), [34, Example 3.3], [33, Example 4.1]) and 145.1.h.a (RM 
by Q(V 29), [35, Example 1.2]). 
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The problem of constructing weight one forms whose projective image is not 
dihedral was considered by Tate and Serre in the 1970s. These forms are sometimes 
called non-banal or exotic. Such forms divide up into 3 cases based on their 
projective image, which can be one of A4, S4, As: the forms are then known as 
tetrahedral, octahedral and icosahedral, respectively. 

Tate together with his students, Flath, Kottwitz, Tunnell, and Weisinger, and 
additionally Atkin, exhibited a form of level 133, with projective image A4 
described in a letter to Atkin [94, p. 713]; this form is 133.1.m.a in our database. 
The smallest level example is actually in level 124, given by 124.1.i.a. 

In the octahedral case, the smallest level example is in level 4-37 = 148 
with label 148.1.f.a; this newform is discussed by Buzzard [18, §2.3] and Darmon— 
Lauder-Rotger [33, Example 5.6]. 

Many modular forms previously considered in the literature with interesting 
Galois representations can now be found in our database. Ogasawara [69] takes 
the mod-3 Galois representations attached to certain elliptic curves and constructs a 
GL2(F3) Artin representation: for example, the elliptic curve of conductor 11 with 
label 11.a3 is used and the corresponding octahedral modular form of weight one 
over Q(,/—2) is constructed. Using the g-expansion coefficients given there, we can 
use the trace search functionality to locate a (unique) matching form in our database: 
3267.1.b.d. We then verify that it has the right Artin field: a degree 8 extension over 
which 11.a3 gains 3-torsion. 

Buhler [13, 14] constructs the icosahedral Galois representation of level 800, 
labeled 800.1.bh.a. Kiming—Wang [55] gave several more instances of icosahedral 
newforms of weight one with characters of order 2, showing their existence in order 
to verify the Artin conjecture in these cases. The new database now contains all but 
one of these: 2083.1.b.b, 1948.1.b.a, 3004.1.b.a, 3548.1.d.a, 3676.1.c.a, 2336.1.c (two 
newforms). The only newspace discussed in loc. cit. with level outside our range 
would have label 6176.1.b. The database also contains the icosahedral newforms 
1376.1.r.a, 2416.1.p.a, 3184.1.t.a, 3556.1.ba.a and 3756.1.q.b which were all shown 
to satisfy Artin’s conjecture by Buzzard—Stein [20]. The proof of Serre’s conjecture 
[57] established Artin’s conjecture for all odd irreducible two-dimensional repre- 
sentations, including all of the icosahedral cases. The smallest level example of an 
icosahedral newform is 633.1.m.b. 

Constructing exotic forms of prime level with specific projective image is also a 
much studied problem. Such forms do not exist in the tetrahedral case [83, Thm. 7, 
p. 245], leaving only octahedral and icosahedral forms with the possibility of prime 
level. 

In the octahedral case the smallest prime level is 229, and the space of newforms 
229.1.d splits into two Galois orbits, (see Serre [83, p. 265]). The second smallest 
level is 283, where we have the newform 283.1.b.b that appears also in work of Serre 
[84]. 

In the icosahedral case, we have seen above the first example of such a form: the 
one with level 2083 of Kiming—Wang. In fact the query for forms with projective 
image As shows that there are 4 such forms with prime level < 4000: 2083.1.b.b, 
2707.1.b.b, 3203.1.b.a, 3547.1.b.c. It is conjectured that these forms are rare. 
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Conjecture 12.5.2. For any € > 0, the number of exotic newforms of prime level N 
is Oc (NS). 


Bhargava—Ghate [5] have shown an averaged version of this conjecture in the 


octahedral case. 
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Alex J. Best and Benjamin Matschke 


Abstract We present a database of rational elliptic curves, up to Q-isomorphism, 
with good reduction outside {2, 3, 5, 7, 11, 13}. We provide a heuristic involving the 
abc and BSD conjectures that the database is likely to be the complete set of such 
curves. Moreover, proving completeness likely needs only more computation time 
to conclude. We present data on the distribution of various quantities associated 
to curves in the set. We also discuss the connection to S-unit equations and the 
existence of rational elliptic curves with maximal conductor. 


1 Introduction 


Databases or tables of all elliptic curves subject to various constraints have been 
published since the 1970s, including in the well known Antwerp IV conference 
proceedings [6]. Such tables are useful both in identifying a given curve appearing 
in nature, or for proving a curve with certain properties does not exist. Tables can 
also be used to answer distributional questions about properties of elliptic curves 
when ordered in different ways. 

The most well known such tables are those of elliptic curves over Q with 
bounded conductor due to Cremona [12, 11], which now form part of the 
LMFDB [26]. 

One may instead, however, construct tables of elliptic curves with bad reduction 
only at primes in a specified set S of rational primes. These are exactly the primes 
dividing the conductor. Organising curves by their primes of bad reduction can be 
quite useful in practise; it is often possible to prove a particular curve has good 
reduction outside certain places, and then conclude that the curve is contained in 
such a table for some S. 


A. J. Best - B. Matschke (J) 
Department of Mathematics and Statistics, Boston University, Boston, MA, USA 
e-mail: matschke @bu.edu 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2021 215 
J. S. Balakrishnan et al. (eds.), Arithmetic Geometry, Number Theory, and 
Computation, Simons Symposia, https://doi.org/10.1007/978-3-030-809 14-0_5 


216 A. J. Best and B. Matschke 


In particular many classical diophantine equations can be phrased in terms of 
the existence of elliptic curves with specified places of bad reduction, see Sects. 4.1 
and 4.2. 

In this paper we compute and study what is conjecturally the complete set of 
isomorphism classes of elliptic curves over Q with good reduction away from the 
first six primes {2, 3,5, 7, 11, 13}. This set and the code and auxiliary data used to 
compute it (including Mordell—Weil bases for almost 100,000 Mordell curves) are 
available at 

https://github.com/elliptic-curve-data/ec-data-S6. 

Many of the curves in this set have quite large conductor, but nevertheless, 
by virtue of having bad reduction at only a few small primes, can be simpler 
arithmetically than other curves with smaller conductor. 


History We now give a non-exhaustive overview of previous work computing 
databases of elliptic curves over Q. 

In the late 1980’s, Brumer and McGuinness [8] computed rational elliptic curves 
of prime discriminant A bounded by |A| < 10%. Stein and Watkins [36] then 
extended this database to include almost all curves up to |A| < 10!? with either 
conductor N < 10° or prime conductor less than 10!°, 

To compute the set of elliptic curves with bounded conductor, Tingley [40] used 
modular symbols to find all elliptic curves with N < 200. This was greatly extended 
and improved by Cremona [12, 11], who has currently computed all of these curves 
up to N < 500000. Initially this approach was only known to compute modular 
elliptic curves, and it was only when modularity was proved that it was confirmed [7] 
that over Q being modular is not a restriction. 


A third natural way to construct a database of elliptic curves is by restricting the 
set of places of bad reduction, i.e. the primes that divide N (or equivalently, primes 
that divide the minimal discriminant). For any finite set of rational primes S, let 
M(S) denote the finite set of elliptic curves over Q with good reduction outside 
of S, up to Q-isomorphism, and let 


Ns:= |]. 


pes 


We may then hope to compute the set M(S) for various sets S. 

The set M({2,3}) was computed by Coghlan [9] and Stephens [37], and 
Coghlan’s data was republished as Table 4 in [6]. Agrawal, Coates, Hunt and van 
der Poorten [1] computed M({11}) via a reduction to Thue—Mahler equations. 
Cremona and Lingham [13] computed M({2, p}) for p < 23 via a reduction 
to the computation of S-integral points on Mordell curves. Koutsianas [25] used 
a reduction to S-unit equations over number fields to compute M({2, 3, 23}), 
as well as curves E € M(S) for various other S satisfying certain restrictions 
on the 2-division field of E. Von Kénel and the second author [22] computed 
M({2, 3,5, 7, 11}) as well as all M(S) with Ns < 1000 using an elliptic logarithm 
sieve to compute S-integral points on elliptic curves. Bennett and Rechnitzer [4] 
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and Bennett, Gherga and Rechnitzer [3] computed M({p}) for all p < 500000 
using a refinement of the reduction to Thue—Mahler equations and Thue equations. 
The latter paper also recomputes M({2, 3,5, 7, 11}) using this approach. Moreover, 
using a heuristic they computed all curves in M({p}) for p < 10!°, without 
guaranteeing completeness. 

Finally we mention that there are various extensions to the above methods 
to compute elliptic curves over number fields with good reduction outside of a 
given set of places. In particular the aforementioned approaches of Cremona and 
Lingham [13] and of Koutsianas [25] generalise to the number field setting. 


Outline The aim of this paper is to compute the set M({2, 3,5, 7, 11, 13}). We 
have computed a subset of this that is heuristically the full set, but is not proved to 
be complete by our method at present.! In Sects. 1.1 and 1.2 we give a summary of 
our data and discuss some statistics of the data. We compare our data to Cremona’s 
database in Sect. 1.3. 

Our computation relies on a reduction to solving Mordell equations in S-integers; 
this is discussed in Sect. 2.1. The main computational bottleneck is computing the 
Mordell—Weil bases of a large set of Mordell curves; this is elaborated upon in 
Sect. 2.2. 

In Sects. 2.3 and 2.4, we discuss a heuristic that our database should be complete 
and the possibility of proving completeness via additional computation. In Sect. 3, 
we show some results suggested by the data regarding the question for which sets 
S there are elliptic curves with good reduction outside S of maximal possible 
conductor. In Sect.4, we discuss connections and applications to solving other 
classical diophantine equations including S-unit, Thue—Mahler and Ramanujan— 
Nagell equations. 


1.1 Summary of the Database 


Let S(n) denote the set of the first n rational primes. According to our computation, 
the set M(.S(6)) contains 4576128 curves in total; see Table 1. Here, j(M(S(7))) 
is the set of distinct j-invariants of curves in M(S(n)); the cardinality of this set is 
therefore the number of Q-isomorphism classes of curves in M(S(n)). 

When n > 2 we can obtain all of M(S(n)) by taking a representative of each 
Q-isomorphism class of curves in M(S(n)) and twisting this representative by all 
integers divisible only by primes in S(n). For j 4 0, 1728, we only have quadratic 
twists. When j = 1728 we have quartic twists, and for 7 = O sextic twists (our 
assumption that n > 2 implies that 0, 1728 € j(M(S(n)))), giving the equation 


#M(S(n)) = 2"*!(#j(M(S(n))) —2)+2-4"4+2-6". 


‘However, work in progress by the second author gives the same set of curves using a different 
method. 
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Table 1 Numbers of elliptic curves with good reduction outside S(n) up to Q-isomorphism and 
up to Q-isomorphism. The asterisk refers to the possible incompleteness of this paper’s table. The 
case n = 0 is the classical result that there is no elliptic curve over Q with everywhere good 
reduction 


n #M(S(n)) #](M(S(n))) _ 
0 0 0 Tate (cf. Ogg [29]) 
1 24 5 [9,. 37,29] 

2 752 83 (9, 37] 

3 7600 442 [22] 

4 71,520 2140 [22] 

5 592,192 8980 [22, 3] 

6 4,576, 128* 34,960* This paper 


This holds in all cases above and provides a quick check that nothing that obviously 
should have been in the database has been missed. 

Each curve in M(S(6)) has conductor N | 28355277 117137, which gives, a priori, 
4374 possibilities for NV. It turns out that exactly 4344 of them are indeed attained by 
curves in our set. The 30 exceptions for which there is no curve with that conductor 
are 


{1, 2,3, 4,5, 6, 7, 8,9, 10, 12, 13, 16, 18, 22, 25, 28, 60, 81, 165, 
169, 351, 945, 1280, 1820, 2673, 2816, 9984, 13365, 362880}. 


These exceptions factor as follows: 


(ya a 5265.9 8 SO 7G, 19: 9 9 1S 7 5, 
BY 9s 54 11, 13", 3 13.9? «50 7," 5 5 7 138, 2 11, 2 1, 
hae anc ae lems es Agere er ee 


These (non-)conductors are all within the range of Cremona’s database, and we can 
therefore check that there are indeed no elliptic curves with any of these numbers 
as their conductor. We note that the largest conductor for which no elliptic curve of 
that conductor exists is less than the square root of the largest possible conductor of 
a curve in M(S(6)). 

Next we consider isogeny classes in M(S(6)). This is also a natural partition 
of curves in the database as M(S(n)) is closed under taking isogenies (any two 
isogenous curves have the same conductor). Our data contains 3688192 disjoint 
isogeny classes in total: 2966912 classes of cardinality one, 646784 of cardinal- 
ity two, 4608 of cardinality three, 60928 of cardinality five, 6784 of cardinality six, 
2176 of cardinality eight, and no others. An example of a curve in M(S(6)) with 
isogeny class of cardinality 8 is 
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y* = x3 + 827614112325 x + 276113445805174250. 


Edgar Costa has computed the analytic ranks of all curves in our table, as 
well as the leading coefficients and root numbers of the associated L-series. His 
computations use interval arithmetic, and hence the leading coefficients are given 
with exact error bounds. The standard problem that remains is that it is impossible to 
verify numerically that the lower derivatives vanish exactly, and thus the computed 
analytic rank is actually only an upper bound once the rank is large enough. 
According to his computations, there are 1884428 curves of analytic rank zero in 
our data, 2267261 of analytic rank one, 406309 of analytic rank two, 18003 curves 
of analytic rank three, and the remaining 127 curves are of analytic rank four. We 
can compare this to the number of rational elliptic curves with conductor bound 
N < 500000 of each rank using Cremona’s database: for these curves, Cremona 
computed analytic and algebraic ranks (and checked that they coincide), and found 
that there are 1632686 curves of rank zero, 2124004 of rank one, 461670 of rank 
two, 11243 of rank three, and 1 of rank four. In both tables, we observe, similarly, 
a larger number of rank 1 curves than rank 0 curves. An intriguing difference is the 
larger number of rank 4 curves in our data, compared to a similar total number of 
curves when ordered by conductor. 


1.2 Distribution of Quantities 


In this section we study the distribution of various arithmetical quantities associated 
to curves in our dataset. As these curves have bad reduction at only the first 
six primes, they are quite structured, and it is interesting to compare answers to 
distributional questions to when curves are ordered with respect to conductor or 
discriminant. 

One fundamental quantity is the conductor. We plot the distribution of the 
logarithm of the conductor for the curves in our database as a histogram in Fig. la. 
We take the logarithm of N due to the multiplicative nature of the conductor. Indeed, 
if the conductor exponents f, in N = [| pes p/» were uniformly and independently 
distributed (which they are not), then in Fig. la we would see an approximately 
normal distribution with mean 14.037 and standard deviation 4.382. The observed 
distribution of log(NV) is comparatively lopsided: It appears denser in the larger 
conductor range. This could be explained by the fact that one can turn good into 
additive reduction at p > 3 via twisting by p (as the reduction of F at p will have 
Kodaira symbol Kj by Tate’s algorithm), without leaving M(S(6)). 

The Szpiro ratio of an elliptic curve over { is defined to be the ratio 


log |Ag| 
o = —— 
log N 
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(a) Elliptic curves in our data set. (b) Elliptic curves with N < 500000. 


Fig. 1 Histograms of logarithms of conductors: (a) shows the curves we computed 
within M(S(6)). For a comparison, (b) shows all rational elliptic curves with N < 500000 
according to Cremona’s database. The bar at log(500000) ~ 13.1 signifies the end of the overlap 
of both tables 
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(a) Elliptic curves in M(S(6)). (b) Elliptic curves with N < 500000. 


Fig. 2 Histograms of Szpiro ratios 0 = log(Ag)/log(N). (a) shows the curves we computed 
within M(S(6)). For a comparison, (b) shows all rational elliptic curves with N < 500000 
according to Cremona’s database. We observe three differences: the larger maximal value for o 
in (b) (namely 8.903700), the larger mean for o in (a), and (b) contains a significant number of 
curves with o = | (namely 602) 


of the logarithms of the minimal discriminant of the curve and its conductor. 
Figure 2a sketches the distribution of Szpiro ratios for the curves in our database. 
Szpiro’s conjecture states that |A¢| = O,(N°t®), or equivalently, that for any 5 > 0 
there are only finitely many elliptic curves over Q witha > 6+ 6. 

Indeed, the largest Szpiro ratios occurring among all curves in our dataset are 
approximately 


8.757316, 8.371586, 8.11481 and 8.034917.... 
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The curves for which these ratios occur are all in the LMFDB and have labels 
858.k2, 2574.j2, 910.e1, and 9438.m2 respectively. The second and fourth of these 
are both quadratic twists of the first. It seems that these three have a large Szpiro 
ratio due to a factor of 3*! in each of their discriminants. The third has a factor of 
2° in its discriminant. These are the only four curves in our set with o > 8. There 
are 123 curves in the database with o > 7, only 15 of which have conductor larger 
than 500000. The largest o from our data with N > 500000 has N = 532350 and 
o © 7.161459. 


1.3, Comparison with Cremona’s Database 


Cremona [12, 11] has computed the set of all rational elliptic curves with conductor 
less than various bounds, currently up to N < 500,000. 

If S$ is the set of primes of bad reduction of an elliptic curve E of conductor N, 
then 


Ns < N < 1728N5. 


Thus, in principle, the problems of computing M(S) and all curves of bounded 
conductor are equivalent. Both parameters S and WN stratify the infinite set of rational 
elliptic curves. In practise, however, these stratifications differ considerably: for 
example, M(S(6)) contains 14,216 curves of conductor 2999527711713" = 10%, 
which is considerably larger than 500,000; and on the other hand, M(S(6)) does not 
contain the four curves with conductor 17. 

Cremona’s database contains at present 1,238,682 distinct j-invariants, whereas 
the computation we performed resulted in 34,960, because for each j-invariant, our 
set contains at least 128 distinct twists. On the other hand, Cremona’s database con- 
tains 3,064,705 Q-isomorphism classes of curves, whereas ours contains 4,576,128. 
Despite the fact that the two databases contain more or less the same number of 
curves, there are 4,376,070 curves in our set not contained in the Cremona database, 
that is, less than 5% of our set overlaps with his. We observe significant differences 
in the distributions of log(V) and of o for both data sets, see Figs. 1 and 2. 

Cremona’s tables contain more information about each curve present there than 
our tables currently do, including Manin constants, generators for the Mordell—Weil 
group, BSD invariants, modular degrees, optimality data, sets of integral points and 
images of Galois representations. Much of this data would be prohibitively difficult 
to compute for every curve in our set due in part to the size of the conductors of 
some of the curves in our table. 
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2 Computation 


In this section, we discuss the reduction of computing M(S) to the problem of 
solving Mordell equations, the computation of the requisite Mordell—Weil bases, 
which is then the dominant computational task to be undertaken, and the heuristic 
completeness of the obtained data. 

The code implementing the methods described here and computed data are avail- 
able online. The repository https://github.com/elliptic-curve-data/ec-data-S6 con- 
tains the majority of the code, and the file mordell.sage of https://github.com/ 
bmatschke/solving-classical-diophantine-equations/ contains an implementation of 
the algorithm of von Kanel and the second author [22]. 


2.1 Computation Method 


Let S denote a finite set of rational primes, let M(S) denote the set of elliptic curves 
over Q with good reduction outside of S, up to Q-isomorphism, and let 


Ns := [[p. 


pes 


For this section, we assume that 2,3 € S, which can be achieved by enlarging S 
if necessary. Let Os = Z[1/Ns] denote the ring of S-integers and O% the group of 
S-units. 

A theorem of Shafarevich [31] states that for any such S, the set of curves M(S) is 
finite. This can be seen as follows: for any E € M(S), choose a minimal Weierstrass 
model for E and consider the cq and c6 invariants and discriminant A, of this 
model. These invariants satisfy the equation c = pe — 1728A¢g and Ag € ZNO. 
If necessary, we may divide this equation by a power of p® for each p € S to obtain 
an equality of the form Y = X? + a, where X,Y € Os anda = + ITpes p®? with 
0 < ey <5(p € 5S). The pair (X, Y) can then be regarded as an S-integral point on 
the Mordell curve 


Eg: Par eE, 


By a theorem of Siegel [32, 33], the set Ez (Os) is finite. From any point in E,(Os), 
we can recover potential invariants cq and c¢ that produce the point, up to any factors 
of p® in cc and ee for p € S. This recovers E up to a quadratic twist by a positive 
S-unit. Moreover, there are exactly 2!5| such twists. 

We deduce that M(S) is finite, and its computation reduces to the computation 
of E,(Os) for finitely many values of a. To determine E,(Os), we use the algorithm 
of von Kénel and the second author [22], who gave a method to compute S- 
integral points on rational elliptic curves E provided that generators of the free part 
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of E(Q) are known. Their implementation uses an elliptic logarithm sieve, which 
can compute E,(Os) in quite an efficient manner. Thus to compute M(S), it turns 
out that computing the necessary Mordell—Weil bases of 2 - 6!5! Mordell curves is 
the computational bottleneck. In Sect. 2.2 we discuss this in detail. 


2.2 Computing Mordell-Weil Bases 


We have carried out the approach outlined above for S = S$(6). We now discuss the 
most computationally intensive part of the process, which is finding the generators 
of the free part of the Mordell—Weil group for a number of Mordell curves, many of 
which have large discriminant. We will use the term Mordell—Weil basis to refer 
to these generators. Note that finding the generators of the torsion subgroup is 
both computationally easier and completely classified for Mordell curves [19], so 
henceforth we assume it is known. 
The curves we consider are those with 


a & {4202357711133 0< e, <5}, (2.1) 


giving us 93,312 curves for which we wish to find the Mordell—Weil bases. 
We can reduce the number of curves that we need to consider using the following 
fact. 


Lemma 2.2 All Mordell curves have a 3-isogeny given by 


y? =x+a> y? =x>-27a (2.3) 
2 2 
3 -9 
(x,y) ¢ amie ae *) (2.4) 
x x 


As the composition of two such isogenies is an isomorphism between two models 
of the same curve, these 3-isogenies partition our set of Mordell curves into pairs. 
The upshot is that if we can find generators of the Mordell—Weil group of one of 
each pair, we can easily find generators for the other by pushing the basis forward 
along the isogeny and saturating, if necessary. Using this we need only compute the 
Mordell—Weil bases of half the curves, and we may choose which of each pair to 
consider. 


2.2.1 Standard Techniques 


Out of the 93,312 Mordell curves, we have computed what should be the analytic 
rank of those with positive a using Pari/GP’s ellanalyticrank [39], via 
Sage [16]. Using the above isogeny, the other half of the curves will have the same 
Mordell—Weil rank. Of these curves, 20,215 have analytic rank zero, 23,186 have 
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analytic rank one, 3112 have analytic rank two, 142 have analytic rank three, and 
only | curve has analytic rank four which is 


y* = x? + 82063881900. 


We assume that the output of ellanalyticrank is correct and that the 
analytic ranks are as stated above. As Pari does not use interval arithmetic, it is 
not clear to what extent these computations are guaranteed to be correct (especially 
for the high rank cases). As we shall see below we have found as many generators as 
there should be for almost all curves. For many of the curves, once a set of generators 
is found, descent techniques can be used to prove that the algebraic rank equals what 
is implied by BSD and that the set of generators is complete. 

By the work of Gross—Zagier [20] and Kolyvagin [24], it is known that analytic 
rank < | implies the rank equals the analytic rank. Therefore no further computation 
is required for the analytic rank 0 curves above. For the analytic rank | curves, we 
need only to find a single non-torsion point which we can then saturate to find a 
basis. 

For many rank | and 2 curves in the set and for all curves of rank at least 3, a 
combination of the built-in Magma and Sage functions and a few other techniques 
summarized below sufficed to compute the Mordell—Weil bases. These included two 
and four-descents methods, point-searching with Stoll’s ratpoints program [38] 
and Simon’s e119 [35] to search for points in some instances. 

For the curves of rank at least 2, sometimes it was only possible to find a subset 
of a set of generators on each curve of each three-isogenous pair. However in this 
case, it was often possible to map one set of generators via the isogeny to the other 
curve, and combine the generators to give a basis for the Mordell—Weil group of 
one (and therefore both) curves. This happened mostly when the height of the found 
generators grew when mapped to the isogenous curve. 

In rank 1, Heegner points are available in addition to the other machinery 
of point-searching and descent [10]. In theory, computing a Heegner point is 
guaranteed to terminate and if the found point is non-torsion, then it is known that 
the curve has algebraic rank 1. However, in order to compute Heegner points, we 
need to find the images of points under the modular parameterization, and hence we 
may need to compute a large number of Frobenius eigenvalues to find the image to 
a large enough precision in order to recover an algebraic point. 

Using a combination of all of these techniques, we found bases for all curves 
except for 16481 of the rank 1 curves, and we found a single generator (but not 
the full basis) for all but 33 of the rank 2 curves. There was one additional rank 2 
curve for which we did not find any infinite-order points with these methods (E, 
fora = 2-3-5-7-11*- 13). It is likely that a part of the rank 1 cases would be 
amenable to the techniques mentioned, by using larger search bounds or more time 
or memory. However it seemed a different approach was needed to find bases of the 
hard rank 1 curves as well as all remaining rank 2 curves. 
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2.2.2 12-Descent 


To determine the generators on these harder curves, we used the 12-descent routine 
in Magma designed and implemented by Fisher [17]. This works by combining a 
3-cover obtained from a 3-descent procedure with a 4-cover from doing 2-descent 
and then 4-descent. In our setting, the presence of a 3-isogeny for all of our target 
curves allows us to use 3-descent by isogeny to obtain the 3-cover. This is more 
efficient, as the number fields involved are smaller than those involved in a general 
3-descent. The implementation for this in Magma is due to Creutz. 

Fisher’s algorithm then determines a 12-cover and a map to the original curve 
from each pair of one 3-cover and one 4-cover coming from these lower descents. 
Therefore to find a generator of the Mordell curve, we loop over all 4-covers and 3- 
covers of the curve coming from descent and search for points on the corresponding 
12-cover. It is expected that if an n-cover has small enough coefficients, the height 
of a preimage of a point of height is roughly h/2n. Therefore, given an estimate of 
the canonical height of a generator of the Mordell curve (coming from the regulator 
estimated via BSD) and a bound for the difference of the naive and canonical heights 
on an elliptic curve (such as [28]), we can search for points on the cover which 
should map to a generator. Because this point should have smaller height, this should 
substantially reduce the time needed to search for points, compared with simply 
searching on the original curve. Using this, we reduce the bound for the height 
search by a factor of up to 24 if the coefficients of the 12-cover are not too large. 
To search for points on the 12-covers, we use the Magma method Point Search, 
implemented by Watkins [41]; see also [43]. This approach has previously been used 
to find generators of large height on single Mordell curves [21]. 

Due to the fact that we do not know the order | III] of the Shafarevich-Tate group 
for our curves, the regulator may give an overestimate for the height of a generator, 
as BSD will only allow us to determine ./|II]- R from readily available information. 

This procedure was carried out with increasing timeout, up to a maximum 
of 12 hours, and was broadly successful in finding a generator of the rank 1 and 2 
curves for which more standard methods failed. 


2.2.3 Remaining Curves 


The combination of these methods has generally sufficed to determine the Mordell— 
Weil bases of the necessary Mordell curves. However, there are 306 rank | curves 
remaining (up to the 3-isogeny above), for which we have so far been unable to 
find the Mordell—Weil bases.* A combination of large conductor and large regulator 


2 Allan MacLeod has communicated to us the Mordell—Weil bases of 10 more of these curves, 
using his own implementations of similar techniques to those outlined above. Nevertheless a fast 
general method to find bases for all of our curves remains elusive. 
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(and hence either large generator height or large |LII|) has prevented any of the 
above methods from working in a reasonable time frame. 

The Mordell curve with smallest regulator for which we do not know a 
generator is 


y* = x? + 730033053750, 


with regulator approximately 167.305352. 
The largest regulators occurring for the remaining curves arise for 


y? = x? + 904509009004500900000, 


which interestingly are quadratic twists of each other (by —1). Their regulators are 
17,550.10 in the + case and 17,628.52 in the — case. However, these curves are 
somewhat exceptional; not all curves are quite so large. The mean of the remaining 
regulators is 2622.49. 

To attack the remaining curves, several options exist to compute a generator. We 
have explored these for a few of the remaining curves that we expect to be “easier”. 

We attempted to make use of Magma’s HeegnerPoint method. As described 
in Watkins [42], this allows the user to use 4-descent to construct a 4-cover of the 
target elliptic curve and then find a Heegner point on the cover, reducing the required 
precision needed and hence the number of required Frobenius traces. Unfortunately, 
this Magma method fails on many of our difficult examples, presumably because 
both the conductor of the curve and the height of the Heegner point are both 
large enough that the number of Frobenius traces needed becomes unwieldy for 
Magma. Due to the closed-source nature of Magma (and the HPInternal2 
and FrobeniusTracesDirect methods in particular) we have been unable to 
rectify these problems. It is also unclear whether or not Magma’s algorithm for 
computing all traces of Frobenius for primes below a given bound for one of our 
curves is optimal. As our curves are Mordell curves they have CM (by /—3); 
therefore, to compute the Frobenius traces, we may make use of Cornacchia’s 
algorithm [10, pp. 597]. 

The highly optimised smalljac package [23] (available from Sutherland’s 
webpage) includes an implementation of this algorithm in the case of j-invariant 0, 
and we expect that using this will be the most effective way to compute enough 
Frobenius traces to find a Heegner point on the remaining curves. 

Happily, Pari/GP’s ellheegner method is more reliable on our examples, 
though it does appear to use the covering method. Thus we expect that it will 
successfully find a generator on several of the remaining curves given enough time. 
It is not clear that Cornacchia’s algorithm is used to compute all of the Frobenius 
traces in Pari. 

For instance, this Pari function has returned a point successfully for one of the 
“missing” curves, we have found a point of 


y? = x3 + 4259854045547100000 
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of height 956.2822, and it is possible this case was more tractable due to the fact 
that this is really the double of a generator which has height 239.07055 instead. As 
we are not actually missing a generator on this curve, we may check that indeed it 
does not give any extra elements of M(S(6)). However, as it took far longer to find 
this point, and it required more interactive experimentation with parameters than 
the descent methods that we used for the vast majority of the curves, we prefer to 
present it separately from the main data. 

In theory, with an increased height bound for point-searching on 12-covers and 
with enough time, a point should be found on such a cover in the same way as we 
found the above. There are two potential issues with this. Firstly a lattice reduction 
algorithm is used in the point-search procedure. It often happens that this method 
gets stuck if these lattices happen to be ill-conditioned for Magma’s algorithm. This 
can stall the point search, and we are not aware of the true cause or of ways of 
avoiding this other than restarting and hoping to get lucky. The second is that the 
coefficients of the 12-cover can be quite large, which can reduce the effectiveness 
of the height-saving of the algorithm. Thus it is very important to minimise the 12- 
cover, as described by Fisher, as much as possible to get the most use out of the 
method. It is plausible that with more work minimizing the 12-covers, the runtime 
of point-searching can be made more feasible. 

We have checked another “missing” example where 12-descent succeeds with 
more individual care than we were able to take at scale. This was curve E, for 
a = 139413405126996000, which has regulator 1504.24027. With a height bound 
of 107! on the 12-covers, the descent finds a point that gives us a generator of height 
1504.24027 on Eg. It is interesting that this point is not a multiple of any smaller 
generator, suggesting that IL is trivial here. 

Higher descents are also a potential avenue to complete the process of finding 
generators for the remaining curves. The work of Fisher allows one to combine 
covers of coprime degrees subject to a numerical condition on the degrees. This 
includes the case of combining an n-cover with an (nm + 1)-cover to obtain an n(n + 
1)-cover. This could conceivably be used to compute 8 - 9 = 72-covers on Mordell 
curves by combining 8-descent and 9-descent (as a second p-isogeny descent), both 
of which have been implemented in Magma. However, it is unknown at present how 
to make describing and combining such covers practical. 


2.3 Completeness of the Data 


First, for many Mordell curves, we have computed what should be their rank by 
computing the analytic rank. This is easier to compute than the algebraic rank 
in general. According to BSD, these ranks are equal, but this is not known in 
general. Computing the algebraic rank is more computationally intensive and can be 
obstructed by non-trivial I. However the analogous computation was performed 
in [22] for S(5). 
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We have in some cases allowed Magma to assume GRH, which speeds up 
computation of class groups, and hence, descent machinery. This does not invalidate 
searching for points on the corresponding covers. Any rational points found are then 
verified unconditionally to be independent elements of the Mordell—Weil group, but 
when proving that the algebraic ranks agree with the analytic ones, either GRH or a 
longer computation time is required. 

Secondly, and more seriously, we are missing any S-integral points on 612 
Mordell curves E, of rank 1, because so far we were not able to find the generator 
of Mordell—Weil for 306 curves (as once one curve from each isogeny class’s basis 
is found, the other may be computed relatively easily). Assuming BSD, we may 
estimate the regulators of these curves up to a factor of ./|I]]. In the rank 1 case, 
the regulator is simply the height of a Mordell—Weil generator. So in the missing 
cases, either the generators are of large height or ./|IIJ], is large as their product is 
at least 150. 

To relate this to the S-integral points on these curves, we recall that the abc 
conjecture can be used to prove the weak Hall conjecture, which states that integral 
points (x, y) on the Mordell curve E,: y? =x +a satisfy x = O(a?**) for any 
€ > 0, see Schmidt [30]. The same proof can be used to show (asymptotic) upper 
height bounds for S-integral points on E,. These make it seem unlikely that an E, 
of rank 1 with a very large Mordell—Weil generator has an S-integral point. These 
estimates could be made explicit if we assume, for example, Baker’s explicit abc 
conjecture [2]. We give more details on this heuristic in Sect. 2.4. 

These missing Mordell—Weil generators of curves of rank | could be computed 
via the Heegner point method, which is, for example, implemented in Pari/GP [39], 
whose complexity to find P € E,(Q) is proportional to /NA(P). Thus, together 
with BSD, we estimate that we can prove completeness of our database in about 50 
CPU years. This is probably less than the (quote) “many thousand machine hours on 
80 cores” that Bennett, Gherga and Rechnitzer [3] used to recompute the database of 
[22] for (5). The original computation of M(.S(5)) [22] was not timed, but recalling 
from memory, it took approximately one CPU year. 


2.4 An S-Integral Weak Hall Conjecture and the abc 
Conjecture 


In this section, we will discuss an S-integral analogue of the classical Hall 
conjecture and how it adds to our heuristic for why our database should be complete. 
As for the classical Hall conjecture, we will show that it is implied by the abc 
conjecture. 

For this section, we will use the following terminology. For any finite set of 
rational primes S, we call a pair of integers (x, y) S-primitive if there is no p € S 
such that p® divides both x* and y*. We formulate an S-integral generalization of 
the weak Hall conjecture. 
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Conjecture 2.5 (An S-integral weak Hall conjecture) Let S be a finite set of 
rational primes. Let D 4 0 be an integer. For any € > 0, any S-primitive solution 
(x, y) of the equation 

yy =x? +aD, x,yeEZ, aeZNO3, (2.6) 
satisfies 


max(|x|'/?, |y|!/3) = O,((NsD)'**). (2.7) 


Recall that the abc conjecture states that for any ¢ > 0 the following holds. If 
a, b, c are coprime integers with a ++ b+ c = 0, then 


max(|a|, |b], |c]) < Oc (rad(abc)'**), (2.8) 


where rad(abc) = Lo iate 
K, rad(abc)'*®, where K, is a constant that depends only on «. 


p. More explicitly, (2.8) states that max(|a|, |b], |c|) < 


Theorem 2.9 The abc conjecture implies the S-integral weak Hall conjecture. 
More explicitly, if the abc conjecture holds for some 0 < € < 0.1 with constant 
Kg, then any S-primitive solution (x, y) of (2.6) satisfies 


max(\x|’?, (yl?) < Ki py, (2.10) 


Our proof largely follows Schmidt’s proof [30] that abc implies the classical 
weak Hall conjecture, although the proof below avoids some technicalities by 
choosing s and ¢ (see proof) in an efficient way. 


Proof Suppose (x, y) is an S-primitive solution of (2.6). Let g = gcd(x?, y?). Let 
A = x?/g, B = —y?/g and C = aD/g, which are coprime integers. As A + B + 
C = 0, the abc conjecture implies that 


max(|x|3/g, |yl?/g) < Ke rad(ABC)'t*. (2.11) 


We claim that 


NsD 
ad ARC) |. (2.12) 
g 


To see this, we consider two cases. 


Case 1. If some p € S divides ABC, then by S-primitivity of (x, y) we have 
ordy(y) < 2 or ordy(x) < 1. In either case, ord,(g) < 4. If ordy(g) = 4, 
then p*|x, p”| y, p| Ns, and thus p|xyNsD/g. The cases ordp(g) € 
{0, 2, 3} are similar, and ord,(g) = 1 is a priori not possible. 
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Case 2. Suppose some p ¢ S divides ABC. If p { g then the obvious p|xyD 
suffices. If p | g, then ord,(xyD/g) = ordp(g)Q/3 + 1/2+1—1) > 0 
and so p|xyD/g. This finishes the proof of (2.12). 


Plugging (2.12) into (2.11) implies that max(|x|?, |y|?) < Ke(xyNsD)!*® and 
hence 


|xl ly < Ket yNsD)St0C4), 


For s = (1 — €)/(1 — 5e) andt = (1 + €)/(1 — Se) we obtain 


|x| < KUO) (Nepye ait), 


Similarly for s = (1 + ¢)/(. — Se) and t = (2 — €)/(1 — 5e) we obtain 


ly| < K3/0-Se) (ng py G+32)/-5e) 
= & ‘ 
This yields 
max(|x|!/?, |y|!/3) << Rene Dy rete), 


For € < 0.1 this reduces to the claimed bounds. 


Let us relate this to S-integral points on the above Mordell curves Ey: y* = 
x? + a, where a is as in (2.1) an S-unit with bounded exponents. Suppose P = 
(X, Y) € Eq(Os). We can a denominators of X and Y by multiplying X~ 3 and 
ye by suitable powers of p® for each p € S, and call the resulting integers x 
and Y. This yields a relation Y Y? = X3 +@, to which we can apply the S-integral 
weak Hall conjecture 2.5 (with D = 1), or alternatively (2.10) as implied by the 
abc conjecture. We obtain conjectural asymptotic height bounds for |X |> and IY |2, 
which imply up to a small explicit constant (depending on S) the same bound on the 
naive height of P, which in turn is, up to an explicitly bounded error, the Néron—Tate 
height A(P). 

In case S = S(6), we can thus make the following heuristic. First, assume 
that the abc conjecture holds for ¢ = 0.1 with a constant K, < 1.1- 108. We 
checked that this bound indeed holds for all abc-triples of the ABC @ Home project 
by de Smit [14] for which we could compute the radical. Using this e and K, 
and the above reasoning, we would obtain a bound for h(P) of approximately 
2(2 log Ke + 2.2log Ns) < 120. 


3 Attainability of Maximal Conductor by Curves in M(S) 


In this section, we prove some results suggested by empirical observations of our 
data. 
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Specifically we ask the following: Given a set of rational primes S, what is 
the highest possible conductor of an elliptic curve over Q with good reduction 
outside S? An immediate upper bound is Ms := |] pes p/» where fo = 8, f3 =5, 
and fp = 2 for p = 5. More specifically we may then ask: 


Question 3.1 Does there exist a curve of conductor of Ms for any set S? 


The answer to this question is no without further conditions on S. For example 
there does not exist an elliptic curve with good reduction away from 5; however, 
the answer is positive for a large class of S. Motivated by our data, we have the 
following sufficient criterion. 


Theorem 3.2 Let S be a finite set of rational primes that contains either 2 or 3 (or 
both). Then there exists an elliptic curve over Q with conductor N = Ms. 


In order to prove the theorem, we recall the notion of quadratic twists of elliptic 
curves. For any rational elliptic curve E: y* = x* + ax + b and an integer d, we 
denote by E47: y* = x3 + d?ax + d*b its quadratic twist by d. 

The theorem now follows immediately from the following lemma. The proof is 
constructive. 


Lemma 3.3 Let d be a square-free product of primes p > 5. 


1. Let E23): y? = x3 — 18x + 24. Then Ef, 3} has conductor N = 283°d* and 
Kodaira ‘ype Il a 2, at 3, and Ip at p = 5 with p|\d. 

2. Let Ey): y* = = x3 + 8x. Then Ef 12} has conductor N = 28d? and Kodaira type 
III* at 2 and Yj at p = 5 with p|d. 

3. Let E43): y? + y = x3 — 1. Then Efy has conductor N = 3°d? and Kodaira 
type IL at 3 and Ij at p = 5 with p | d. 


Proof This is a straightforward computation with Tate’s algorithm, which we omit 
here. For the convenience of the reader, it is available as an appendix of the GitHub 
and arXiv version of this paper, which can be found at: 
https://github.com/elliptic-curve-data/ec-data-S6/blob/master/docs/paper.pdf 
oO 


We remark that in general, twisting an elliptic curve E by a prime p > 5 may 
change the reduction type of F at 2 and 3, but this does not happen for the three 
curves listed in the lemma. 

Silverman [34, Exercises 4.52, 4.53] gives two families of elliptic curves defined 
over ( that have maximal possible conductor exponent at 3 and at 2, respectively, 
and also have this property after base changing to a number field. The above curve 
E12; belongs to Silverman’s latter family. 
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4 Applications 


In this section we will briefly discuss some applications of the dataset. 


4.1 Solving S-Unit Equations 


Let S be a finite set of rational primes. As above, denote by Os and O% the S- 
integers and S-units, respectively. The S-unit equation is the equation 


xty=l1, x, ye Os. (4.1) 


This classical diophantine equation is intimately related to the abc conjecture; 
this can be seen by clearing denominators to obtain an abc equation. Also, more 
generally, S-unit equations over number fields are known to have only finitely many 
solutions, as was first shown by Siegel [32] and Mahler [27]. Siegel [32, 33] used 
this to prove that any hyperelliptic curve of genus at least one has only finitely many 
S-integral points. 

It turns out that solving S-unit equations can be reduced to the computation 
of M(S U {2}) via Frey—Hellegouarch curves: If (x, y) is a solution of the S-unit 
equation, then EF, : Y? = X(X — 1)(X — x) lies in M(S U {2}). Moreover any curve 
E € M(S U {2}) can be obtained in this way from at most six different solutions 
of (4.1), and these can be computed from the six possible modular 4-invariants 
of E. In our case, (4.1) for S = S(6) is exactly the case that was considered by 
de Weger [15]. He proved that, up to symmetry, it has exactly 545 solutions. We 
checked that the curves associated to all of these can be found in our database, which 
means that our database certainly contains all Frey—-Hellegouarch curves with good 
reduction outside S(6). We remark that (4.1) has been solved for S = S(16), as well 
as for all S with Ns < 107 [22]. This is far out of reach for the above method of 
reducing (4.1) to computing M(S). 

In the other direction, the computation of M(S) can be reduced to solving S’-unit 
equations over finitely many number fields, where the number fields are all possible 
number fields K of degree at most six that are unramified outside S U {2} and S’ 
being the primes in K above S U {2}. This link was made into an algorithm by 
Koutsianas [25]. 


4.2 Other Diophantine Problems 


Many other diophantine problems reduce to the computation of M(S), notably cubic 
Thue—Mahler equations 
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ax? + bx’y +cxy?+dy3 =m Il Dp’, x,yEZ, ep €Z>0 (pES), 
pes 


where a,b,c,d,m € Zandm 0 are given such that the left-hand side has non- 
vanishing discriminant. Likewise generalized Ramanujan—Nagell equations 


x +b=y, x€Os, yEOS, 


where b # 0 is a given integer, can be reduced to computing M(S). In particular, we 
can find solutions for these equations for S = S(6) via our computation of curves 
in M(S), which subject to the hypothesis that we have in fact found the whole set 
M(S) should be the complete sets of solutions of these equations; see the above 
discussion on completeness in Sect. 2.3. 


4.3 n-Congruences Between Elliptic Curves 


Given n € N, a pair of elliptic curves E,, E2/Q for which E,[n] ~ E2[n] as 
Galois modules are called n-congruent. The Frey—Mazur conjecture implies that 
there should be an absolute bound C such that if p > C and E), E2 are p-congruent, 
then EZ; and £2 must be isogenous. The only known example of a pair of non- 
isogenous 17-congruent elliptic curves, found by Cremona and then Billerey [5], 
occurs for a pair of curves with good reduction outside of 3,5, 7, 13. Using our 
database we searched for similar examples of n-congruences between curves for 
primes 13 < n < 47. We found several instances of 13-congruences that were 
outside the range of existing databases. Fisher has recently found an infinite family 
of 13-congruent curves [18], of which the examples in our database are all members. 
We did not find any further examples of 17 (or higher) congruences between curves 
in our database, other than quadratic twists of the example of Cremona-—Billerey 
mentioned above. 
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Efficient Computation of BSD Invariants M®) 
in Genus 2 ey 


Raymond van Bommel 


Abstract Recently, all Birch and Swinnerton-Dyer invariants, except for the order 
of I, have been computed for all curves of genus 2 contained in the L-functions and 
Modular Forms Database [LMFDB]. This report explains the improvements made 
to the implementation of the algorithm described in [vBom19] that were needed 
to do the computation of the Tamagawa numbers and the real period in reasonable 
time. We also explain some of the more technical details of the algorithm, and give 
a brief overview of the methods used to compute the special value of the L-function 
and the regulator. 


Mathematics Subject Classification (2010) 11G40, 11G10, 11G30, 14H40 


1 Introduction 


The Birch and Swinnerton-Dyer conjecture has an extensive computational history. 
The conjecture has originally been conceived by Birch and Swinnerton-Dyer based 
on computational findings on elliptic curves. Later, it has been generalised by Tate 
to the case of abelian varieties over general number fields. We formulate the version 
for principally polarised abelian varieties over Q here. 


Conjecture 1 (BSD for Abelian Varieties over Q, [BiSw65, Tate66]) Let A/Q 
be a principally polarised abelian variety of dimension d and algebraic rank r. Let 
L(A, s) be its L-function, R its regulator, UW its Tate-Shafarevich group and QQ its 
real period. For each prime number p, let cy be the Tamagawa number of A at p. 
Then L(A, s) has a zero of orderr at s = 1 and 
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R-Q- {U0 -T],¢p 
|A(Q)tors|? 


lim (s — 1)" L(A, s) = 
sl 


Since the group IL is very hard to compute in general, it has been common to verify 
the conjecture “up to squares”. For this, one computes the other terms numerically, 
and checks that the value of |II| conjectured by the formula is close to a square 
or twice a square, according to what the criterion of Poonen and Stoll, [PoSt99], 
predicts. 

For example, this has been done in the paper [FLSSSW] by Flynn, Leprévost, 
Schaefer, Stein, Stoll and Wetherell for a collection of 32 modular Jacobians of 
hyperelliptic curves of genus 2 with small conductor. Modularity was needed for 
the computation of the special value of the L-function. In [vBom19], the author 
developed and implemented algorithms in Magma to compute Tamagawa numbers 
and real periods for hyperelliptic curves of genus 2. The algorithms only use the 
equation of the curve and build upon existing methods implemented by Steve 
Donnelly to compute regular models. 

The current goal is to compute all these BSD-invariants for the curves of genus 
2 in the [LMFDB], see also [BSSVY], and the curves of higher genus that will 
appear in the future in the [LMFDB], see also [Suth19]. The Tamagawa numbers 
had already been computed for all but 54 of the genus 2 curves in the database. For 
the other 54 curves, and for most of the real periods, the algorithms developed in 
[vBom19] did not give a result within a reasonable amount of time. 

Given that there are more than 66,000 curves of genus 2 in the [LMFDB], the 
computation of a Tamagawa number or real period should ideally take at most a few 
seconds for most of the curves, and only more than an hour in very exceptional 
cases. In this report, we will explain the improvements that have been made to 
achieve this, and therefore finish the verification of BSD up to squares for all curves 
of genus 2 in the [LMFDB]. We also report on the computation of the real periods for 
both hyperelliptic and non-hyperelliptic curves of genus 3 from [Suth19]. Moreover, 
this report will also serve as a more extensive documentation of the algorithms, 
explaining more details of the computation than the description in [vBom19]. 


Notation Throughout this text, let C over Q be a smooth projective curve, let p be 
a prime number and let C / Zp) be a regular model of C over Zp). Let J be the 
Jacobian of C. 


2 Tamagawa Numbers 


Classically, Tamagawa numbers are invariants which are associated to the so called 
Tamagawa measure on an algebraic group. However, in the context of the Birch and 
Swinnerton-Dyer conjecture, Tamagawa numbers are defined a bit differently. To 
find more about the relation between the classical definition and the one used in the 
Birch and Swinnerton-Dyer conjecture, see [Blo80] for example. We will use the 
following definition. 
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Definition 2 Let A be an abelian variety over Q. Let p be a prime, and let A/Z,p) 
be a Néron model of A/Q. Let g = Ar, /AR, be the component group scheme of 
the special fibre of A. Then the Tamagawa number of A at p is defined as cp(A) := 


Ip(Fp)I- 


In practice, if we have a curve of higher genus, it is already infeasible to compute 
equations for its Jacobian in projective space. The computation of a Néron model is 
certainly out of scope. The following theorem by Raynaud gives a relation between 
the component group scheme and a regular model of the curve. 


Theorem 3 ([BoLi99, Theorem 1.1, p. 3]) Let C, and J be as usual. Let chens 
be a regular model of C over the strict henselisation of Zp), and let I be the set of 


components in the special fibre of chens rer 


¢ @ be the linear map Z' — Z! given by 


re YO) Pr, forte, 
Yel 


where (-, -) is the intersection pairing on chens 
* B be the linear map Z! - Z given by 


I+ multiplicity(L), forl eT, 


¢ @y be the component group of the special fibre of a Néron model of J. 
Then Gal (Fp/Fp) acts in a natural way on I and Z!', and hence onim@ and ker B, 


and there is a canonical exact sequence of Gal (Fp/ F 2) -modules 


0—> ima > kerB > vs( Fp) > 0. 


Briefly summarised, our algorithm to compute the Tamagawa numbers (see also 
[vBom19]) consists of the following steps: 


1. Compute a regular model C for C. 

2. Find the set J of components of the special fibre of Ch°". 
3. Compute the action of Frobenius on /. 
4 
5 


. Use Theorem 3 to construct g; (Fp) as a Gal (Fp/ F r) -module. 
. The Tamagawa number cy is then the cardinality of 


01 (Fp) = 9 (Fp) Fe/®), 
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For step 1, we use the method RegularMode1 in Magma, which gives us patching 
data for a regular model. In Sect. 2.1, we explain in more detail how this model is 
computed and represented. 

The rest of this section will be focused on step 3, the computation of the action of 
Frobenius on the components. Section 2.2 contains an outline of the computation of 
this action. The further subsections explain some problems we encountered during 
this computation and how we solved these. 


2.1 Computation of a Regular Model in Magma 


The existence and construction of a regular model for C relies on the resolution of 
singularities for arithmetic surfaces. The following result by Lipman implies that 
such resolutions exist. 


Theorem 4 ([Lip78, Art86]) Suppose A/Z(p) is a normal proper flat model of 
C/Q i.e. A is an arithmetic surface over Z(p) and its generic fibre is isomorphic to 
C. Define a sequence 


A=Ap<- Ar <- Ar<... 


of arithmetic surfaces as follows: the morphism Ay < An4+1 is the normalisation 
of the blow-up of An in the non-regular points of An. Then for sufficiently large n, 
the arithmetic surface A, is regular. 


In other words, we can obtain a regular model from any flat model by repeatedly 
normalising it and blowing up the non-regular points. This is not exactly the way 
in which Magma computes a regular model, but it is close. What the algorithm in 
Magma actually does is to construct a sequence of blow-ups 


Co -—Ci — Oo <—...<—Cy =C, (1) 


where Co is some proper flat model of C we start with, C is a regular model we end 
with, and each morphism C,, < C,+, is a blow-up in either a point or irreducible 
component of the special fibre of C,,. If there is a choice between blowing up a point 
or a component, components will be blown up first (although the latter has only 
been implemented for curves over Q). 

Let us briefly describe the data involved in this production of a regular model by 
Magma. In general, the input of the algorithm is a smooth projective curve defined 
over a number field K and a prime ideal p of Ox, the prime at which the model must 
become regular. The regular model is stored as a collection of affine patches and 
transition maps between some of these patches. Each time a blow-up is done inside 
a certain patch, some new patches with transition maps to and from the original 
patch are created. Other relevant data that are stored are sets of non-regular points, 
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points where components in the special fibre of the model intersect, multiplicities 
of these components, and intersection numbers. 

In some cases, because the non-regular points in the special fibre are blown up 
one at a time, it could happen that such a non-regular point is not defined over Ox /p. 
In this case, the number field K is extended and the algorithm is restarted with this 
new number field as base field. Also in the end, when we want to compute the set 
I of geometric components of the regular model (i.e. the set of components in the 
special fibre of Ch€"S), the number field may need to be extended. 

The reader might wonder at this point why we chose to work over number fields 
instead of over local fields. Indeed, the theory is cleaner to state in the local setting, 
but the problem is that some of the computational tools that we need have not been 
implemented for local fields. For example, multivariate polynomial factorisation has 
only been implemented for finite fields, number fields, function fields of curves over 
such fields, Z, and some finitely generated extensions of these rings. 


2.2 Outline of Algorithm to Compute the Galois Action 


Even though our curve C is defined over Q, we have seen in the previous subsection 
that it might be necessary to work over a general number field K. We will go through 
the algorithm that Magma uses to create its regular models, and see how we can track 
the action of Frobenius on the special fibre during this process. Recall the notation 
from Eq. (1). 

At the start, we have a proper flat model Cg all of whose equations are defined 
using coefficients in Zp). First we compute the irreducible components occurring 
in the special fibre of Co and the action of Frobenius on them. This can simply be 
done by applying Frobenius on the ideal defining such a component in one of the 
affine patches containing the component. Then we proceed to look at the non-regular 
points. 

Suppose P is a non-regular point in the special fibre of Cg. Then all Galois 
conjugates of P are also non-regular in Co. Suppose P is defined over F,,« for some 
€ > 1. Then the special fibre of the blow up of Co in P can be defined over Fc. 

Let Comp p be the set of components in the special fibre of C that map to P € Co, 
and define Compg analogously for any Galois conjugate Q of P. Then Frobenius 
sends components in Comp p to components in Compr, op, p)- Of course we have the 
following properties. 


Proposition 5 


(a) The bijection Compp — Comp pop p) induced by Frobenius preserves inter- 
section numbers. 

(b) If Q # P is a Galois conjugate of P, then components in Comp p and Compa 
do not intersect. 
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(c) For any component D of the special fibre of C and any component E in 
Compprop(p) We have 


(D, E) = (Frob~!(D), Frob~!(E)). 


As a consequence, intersection numbers of components in Compp,op(p) With any 
other component of the special fibre of C can be reduced to an intersection number of 
a component in Comp p with either another component of Comp p, or a component 
not lying in any of the Compg, with Q ranging over the Galois conjugates of P. 

In particular, it is not of importance for us to compute which component 
of Compp is exactly mapped to which component of Compr, pp). Instead, our 
algorithm just creates £ copies of Comp p, and lets Frobenius map a component to 
the corresponding component in the next copy, for the first 2 — 1 copies. For the 
£-th copy the action of Frobenius is computed differently. For this, we compute the 
action of Frob’ on Comp p ina recursive way. 

The algorithm backtracks the construction of the regular model, while keeping 
track of all the intersection numbers and the action of Frobenius. To summarise our 
recursive algorithm: 


Algorithm 6 Input: a non-regular point P of Cn. Suppose k(P) = F pe. 

Output: combinatorial description of Comp p, the set of components in the special 
fibre of C that map to P.. This combinatorial description includes the action of Frob‘ 
and the intersection numbers. 


Step 1 Find equations for the components in the special fibre of C,+; which are 
contracted to P, and compute the action of Frob’ on these components, 
and the intersections of these components with each other. 

Step 2 Loop through all Galois orbits of non-regular points in C,+1 mapping to 
P. Execute steps 3 to 7 for each orbit. 


Notation for steps 3 to 7: Q is a point of such an orbit and k(Q) = F jem. 


Step3  Recursively, call Algorithm 6 with the non-regular point Q of Cy41 as 
input. In this way, we compute the action of Frob“” on Comp g> and the 
intersections of these components with each other. 

Step 4 Compute the intersection of the components considered in step 1 with the 
components in Compg. (This is already done by the RegularModel 
command in Magma.) 

Step5 Create m — | more copies of Compg for Compr, ,p¢ (Q): Comp pr op2¢ g) et 


cetera. 
Step 6 The action of Frob° is the identity Comp pop! 0) > Comp py op!-+1) Q) 
fori = 0,...,m — 2. For i = m — 1, the action is given by the action 


computed in step 3. 

Step 7 The intersection of components contained in Comp prop" 0) and compo- 
nents contained in Compr, .p¢- i(Q) is O for distinct 7, 7 € {0,...,m— 1}. 
The intersection number of a component D € Compp, 4°, Q) fori = 
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1l,...,m — 1 witha component E considered in step 1, equals the 
intersection number of Frob~'“(D) and Frob~'"(E). 
Step 8 Combine and output all the combinatorial data that has been collected. 


The improvement in comparison with the previous implementation is the more 
systematic implementation of Steps 5, 6, and 7, which led to a significant speed- 


up. 


2.3 Problem Arising During the Algorithm 


One problem that arose regularly in our algorithm is the following. When we blow- 
up our point P with coordinates in F p> the special fibre of the new patches are not 
necessarily defined over F pé even though they could have been. This is due to the 
choice we have when parametrising the blow-up and is illustrated in the following 
example. 


Example 7 Suppose K = Q[a]/(a* + a+ 1) and p = 20x. Consider the affine 
curve given by y? = 2(x? + x + 1) in A? over Ox,y. It has two Galois conjugate 
non-regular points (a>, 0) and (a!°, 0) in the special fibre, with coordinates in F4. 
Let us do a blow-up in (a>, 0). We rewrite the equation as 


y= x? 4+ Ae +2) + Ga = 40), 


where x’ = x — a>. To get one of the charts of the blow-up, we ‘substitute’ y = tx’ 
and 2 = sx’. We get the equation 


tx! = 7x3 4 (2a + L)sx’” + (a? _ a)s?x'? 
which after dividing by x’? gives the following equations for the blow-up: 
t= s7x! + (2a> +1)s+ (a? _ a)s*, sx’ = 2. 


Now we see that the special fibre is not defined over F4, as a—a ¢ Fa. 


As this problem occurred quite often, we tried to look for a solution that is efficient 
to implement. We are aware of two suitable possible solutions: 


1. Blow-up Galois conjugated points at the same time (e.g. (a>, 0) and (a!°, 0) 
in the previous example). This ensures all schemes stay defined over Zip). 
We choose not to take this route as the blow-up becomes significantly more 
complicated to represent when blowing up multiple points at the same time. 

2. For any number field K that we encounter in our algorithm (e.g. when we start 
with a curve over Q, but we need to extend the field we are working over several 
times in order to blow up non-regular points, as described in Sect. 2.1), make sure 
that for each d dividing [K : Q] there is a subfield Kg C K of absolute degree 
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d. In this way, if a non-regular point in the special fibre is defined over F a, all 
equations for the blow-up can be taken to lie inside Kg and the special fibre of 
the blow-up is guaranteed to be defined over F pt: 


In the next section, we explain what should be kept in mind when constructing 
number fields as in 2., and how it has been implemented in our algorithm. 


2.4 Construction of a Suitable Extension 


For this subsection we say that a number field K has the subfield property, if for 
every divisor d of [K : Q] there is a subfield of K of absolute degree d. Our goal 
is now to construct number fields having the subfield property, as described in the 
previous subsection. Let us first prove that they exist, and that we can extend them. 


Proposition 8 Let £ be a positive integer and let p be a prime. Let K be a number 
field having the subfield property such that p is inert in K. Then there exists an 
extension K C L of degree & such that p is inert in L, and L has the subfield 
property. 


Proof It suffices to consider the case @ is prime. Write [K : Q] = 0° -m, where m 
is relatively prime to €. Let Kee be a subfield of K of absolute degree ¢°, and let Kin 
be a subfield of absolute degree m. Then construct any extension Kee C Lye+1 such 
that p is inert in L,-+1, for example by constructing an equation for the extension of 
residue fields F pe F eet! and lifting it to Kee. Define L as a compositum of the 
linearly disjoint fields K,, and Lye+1. Then p is inert in L. Now we show that L has 
the subfield property. 

If d is a divisor of [L : Q] = £°+! - m, then either d divides [K : Ql], or it is of 
the form ¢°+! . m’ for a certain divisor m’ of m. In the former case, K already has 
a subfield of degree d. In the latter case, K has a subfield K,, of degree m’ and the 
subfield Lye+1 Kj," of L has degree d. 


There are of course many ways to construct fields with the subfield property, but 
one thing to keep in mind is that we would like to avoid the situation where the 
coefficients for equations for the number field get too large to do any meaningful 
computation with them. 

Our first attempt was to only consider abelian extensions, i.e. subextensions of 
cyclotomic fields. They typically have very small defining equations. However, the 
problem is that we could not answer the following question affirmatively. 


Question 9 Is Proposition 8 still true if K and L are required to be abelian? 


So we decided to construct the fields following the strategy of the proof of 8. 
To keep the defining equations for our number fields small, we apply a reduction 
algorithm every step. This has been implemented in Magma under the name 
OptimisedRepresentation. For this function, it is important that the ring 
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of integers of K is computable, which is a property we would like to have anyway 
for the computation of a regular model. A big bottleneck in the computation of the 
ring of integers is the factorisation of the discriminant of K, therefore we would like 
to keep this small. 

Hence, we use the following strategy for each extension K C L we construct. We 
first fix the extension modulo p, as we want p to remain inert in L, i.e. we extend 
K taking the root of some yet to be determined polynomial f € K[x] and we fix f 
modulo p. We then try to find a local minimum for the discriminant of f. Ie., we 
start with any such /, and we repeatedly add or subtract p times a randomly chosen 
monomial. If the discriminant of f gets larger, then we revert the last change. 

In practice, this worked for all curves of genus 2 contained in [LMFDB]. We 
identified two ways in which this algorithm could still be further improved. First, 
there might be better ways to find ‘small’ field extensions. Second, it might be 
worthwhile to determine in advance for which d the subfield Kg of absolute 
degree d is actually needed. Then there would be fewer constraints and hence more 
candidates for the number field K. 


2.5 Implementation 


The algorithm has been implemented by the author in Magma, changing partly the 
way regular models are constructed. The computation of the Galois action on the 
component group took a negligible amount of time after a suitable regular model 
was constructed. Most time was spent on the construction of a regular model. 

For the 66,158 curves of genus 2 in the [LMFDB], the construction of suitable 
regular models for all of the primes of bad reduction, took about 2.02 s on average 
per curve. For 36 of these curves, the computation took longer than 60s, the longest 
one taking about 1600s. 

In principle, the implementation can also be used to compute Tamagawa numbers 
for the curves of genus 3 from [Suth19]. Currently, this has not been done, and it is 
not inconceivable that there could be a few tricky cases for which the computation of 
the ring of integers of a suitable field extension, as described in Sect. 2.4, becomes 
a bottleneck. 


3 Real Periods 


Let A be an abelian variety over Q of dimension g. The real period has been 
defined in different ways in the past. Traditionally, this has been defined in terms 
of an integral of a g-form, the so-called Néron differential, along A(R). It can also 
be defined in terms of integrals of 1-forms along homology cycles of A. For the 
comparison of the different definitions see for example [Gro82]. 
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Definition 10 Let m be the number of connected components of A(R). Let A 
be a Néron model of A over Z, and let (y1,..., 7g) be a basis for the group 
H,(A(C), Z)@"C/®) of homology cycles invariant under complex conjugation. Let 
(@1,..., @g) be a Z-basis of Wz). Then the real period of A is defined as 


& 
det (/ 07) A 


In the case of a Jacobian with Néron model 7, the Abel-Jacobi map gives a bijection 
between Q) at ) and the global sections we/z (C) of the canonical sheaf of a 
regular model, see for example [vBom19, Sect. 3.2] and [Mil86, Sect. 2]. As a 
consequence, we get the following result, which gives us a practical way to compute 
the real period of the Jacobian of a curve. 


Q4=m- 


Proposition 11 Let C, J andC be as usual. Then 


& 
det (/ 07) 
vi ij=l 


where (y1,..., ¥g) is a Z-basis of H\(C(C), Z)CC/R) | and (w1,..., @g) is a Z- 
basis of wc;z(C), and m is the number of connected components of J(R). 


Qy=m- 


’ 


An algorithm to compute real periods is described in [vBom19, Algorithm 13]. We 
repeat it here. 


Algorithm 12 Input: a curve C of genus g over Q. 
Output: the real period (2; of its Jacobian J. 


Step 1 Compute the big period matrix (/ - oy i Here w = (@,..., Mg) 
is any basis of Q6ig(©) and (y1,..., 72g) is a symplectic basis of 
A, (C(C), Z). 
Step 2 For each subset J C {1,...,2g} with |/| = g, calculate the covolume 
——— \ j=l.,....g 
ele) | 
I € f,@i + Sy, i icl 


Step 3 Compute a generator P for the lattice inside R spanned by the P,. 

Step 4 For each bad prime p, construct a regular model C? /Z(p) of C. 

Step 5 For each bad prime p and each of the differentials w,, ..., @g, check if the 
differential has a pole on any of the irreducible components of the special 
fibre of C?. If so, adjust the basis by multiplying the differential having a 
pole with p to get a new basis w’ and apply Step 5 again (until the basis is 
not changing anymore). 

Step 6 For each bad prime p and each (ci € {0,..., p—1}®\{(0, 0, ..., OJ}, 
check if Dy cj@j; vanishes on the whole special fibre of C”. If so, adjust 
the basis w’ by replacing one of the w; such that c; ¢ 0 with ; Vj cie;, 
then apply Step 6 again (until the basis is not changing anymore). 
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Step 7 For each bad prime p compute p“~?, where a is the number of basis 


adjustments done in Step 5, and b is the number of basis adjustments done 
in Step 6 (this is also the determinant of the change of basis matrix whose 
columns express w’ in terms of w). Then take the product W over p of 
these determinants, and output W - P. 


In order to compute the real periods for the 66, 158 hyperelliptic curves of genus 2 
contained in [LMFDB] some optimisations had to be done: 


1. In our original implementation, many computations appeared to be done multiple 
times. To avoid this, new data structures have been constructed to store all rings, 
ideals, their Grobner bases and other relevant objects that are needed multiple 
times during the computation. 

2. In our original implementation, we used Van Wamelen’s algorithm [vWam06] 
implemented in Magma to compute the big period matrix in Step 1. This only 
works for hyperelliptic curves with a simplified WeierstraB model of odd degree. 
We moved to Pascal Molin and Christian Neurohr’s implementation that can 
compute the Riemann surface associated to any curve [MN19, Neu18]. This also 
had advantages for the hyperelliptic case as it allowed us to take simpler (e.g. 
even degree) models, which typically give rise to simpler regular models. 

3. In Step 6 of the algorithm, for p* — 1 differentials it is tested whether or not they 
vanish on the special fibre. This should not be necessary and in principle be a 
linear algebra problem. This is explained in more detail in Sect. 3.3. 

4. The Grébner basis computations over Z took most of the time in this algorithm. 
In some cases, even the computation of the function field over Q of one of the 
patches of the regular model was taking a lot of time. In Sect. 3.2, it is explained 
how we circumvented Grobner basis computations over Z. 


Moreover, we will consider some other ideas that could be used to improve this code 
even further in the future. 


3.1 Computation of the Order of Vanishing of a Function 


First, we will briefly explain how the order of vanishing of a function is computed. 
We reduced our computation to one inside a polynomial ring over Z. In the following 
example, we illustrate how this is done. 


Example 13 Consider the scheme given by the equation y* = x? — 2x — 2 in AZ. 
We take p = 2. We want to compute the order of vanishing of the function 2 on 
the component given by the vanishing of x + y and 2. In other words, we want to 
compute the multiplicity of that component in the special fibre. 

We start with the ideal J = (x + y, 2) C Z[x, y] and notice that 2 lies in it, so 
the order of vanishing is at least 1. Then we compute 


b=P +0? -2x74+2x42 = (7? + y? +2, 2x42, 2y+2, 4. 
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Even though 2 does not lie in J, we see that the ideal quotient 
(h:Q)=@41, y4+1, DEI. 


In other words: 2 when multiplied with some unit lands in Jy, and 2 vanishes with 
order at least 2. We proceed to compute 


hea G4 4 2F4 9) 


= (x7 + 2x + 3y7 +6, 2xy + 2x + 2y? +2y, 4x +4, 4y 44, 8) 
and 
(a O)\ SG? 4 ey ay ey Oe 2 yO. A), 


which is easily seen to be contained in J. Hence, the vanishing order of the function 
2 on the component given by the vanishing of x + y and 2, equals 2. 


So in order to compute the order of vanishing, we make use of the following 
proposition. 


Proposition 14 Let R be a commutative ring. Let J be an ideal of R, and let I 
be a prime ideal of R containing J. Let n be an integer. Then an element f € R 
maps to an element in I" - (R/J)1 C (R/J)1, i.e. f vanishes with order at least 
n on the component defined by I in Spec(R/J), if and only if the ideal quotient 
(I” + J : (f)) is not contained in I. 


Proof Suppose f € R/J lands in I” - (R/J); after localisation. This means that 
there are g € J” andh ¢ I, such that (f — g)h = 0 € R/J, or in other words 
(f — g)h € J. From this, we deduce that h € (1” + J : (f)), as desired. 

Conversely, if h ¢€ 1” + J : (f)) is an element such that h ¢ J, then we can 
write hf as j + g for some j € J and g € J”. Then we geth f = @ € I" - (R/J)7. 
As h is a unit inside (R/J); we get that f € I” - (R/J) , as desired. 


This leads to the following algorithm. 


Algorithm 15 Input: an f € Z[x,..., Xm] and ideals I and J of Z[x,...,Xm] 
such that V(J) is a flat curve over Z on which f does not vanish. We assume I is 
a prime ideal containing J, defining an irreducible component in the special fibre 
V(J)p of this curve. We assume that the local ring Oy(j),1 is regular. 

Output: the order of vanishing of f at V(Z). 


Step 1 If f is not irreducible, find a factorisation for f and run the rest of the 
algorithm for each of its irreducible factors. 

Step 2 Start with n = | and Jp = J. 

Step 3 Compute a Grébner basis for the ideal J, = [,_-;-I1+J of Z[x1,..., Xm]. 

Step 4 Compute generators for Uy : (f)). 

Step 5 If any of these generators is not contained in /, then return the value n — 1. 
Else, increase n by | and proceed with Step 3. 
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Proposition 16 Algorithm 15 terminates and is correct. 


Proof By induction we see that J, = I” + J. The correctness follows immediately 
from Proposition 14. As f does not vanish on V(J), the element f reduces to a 
non-zero element of the discrete valuation ring Oy,,),;. In particular, the order of 
vanishing of f at V(/) is finite, and the algorithm will terminate. 


The ideal quotient and membership of ideals can be checked using Grobner basis 
machinery. This has been implemented in Magma and other packages in the case R 
is a polynomial ring over a field or a polynomial ring over Z. 

In case we are working over a ring of integers Ox which is not Z, we can 
represent Ox as a Z-algebra of finite type and still use the Grobner basis machinery 
over Z. For the computation of the real period, the field extensions that are necessary 
when computing a regular model, as explained in Sect.2.1, are not chosen as in 
Sect. 2.4, but they are chosen in such a way that Ox is monogenic. In this way, we 
can proceed with the Grébner basis calculations at the cost of adding just one extra 
variable. This is how we get an algorithm to compute the order of vanishing of a 
polynomial. 

In general, a function will be of the form Ll where f and g can be written as 
polynomials with integer coefficients. The order is then computed as the difference 
between the order of vanishing of f and g. 

The reader might be wondering at this point why we are working over Z instead 
of over the ring of integers of a local field. The main reason is the availability of an 
algorithm to factor multivariate polynomials over Z in Magma. If f and g are two 
polynomials, then the computation of the order of vanishing of f and g generally 
takes way less time than the computation of the order of vanishing of f - g (without 
using the factorisation). 


3.2 How to Avoid Difficult Computations in Characteristic 0 
3.2.1 Working over Z/p”Z Instead of over Z 


In Example 13, all the calculations could have been done over Z/4Z instead of Z, as 
we already know in advance that 4 has a strictly higher order of vanishing than 2. In 
fact, we replaced the current algorithm to compute the multiplicity of a component 
with this new algorithm working over Z/p?Z, leading to big speed-ups in some 
difficult cases. 

In general, when we consider a polynomial f and we want to determine whether 
f vanishes with order at least r in a component with multiplicity m, it suffices to 
do all computations over Z/p!"/"!+!Z. In this way, we can avoid Grobner basis 
computations over Z, which take significantly more time. 
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Proposition 17 Let f,r,m be as above. If Algorithm 15 is run over Z/p\/"!+'Z, 
instead of Z and outputs a value less than r, then the output is correct. If the output 
is greater than or equal to r, then f vanishes with order at least r. 


Proof Instead of testing whether f lies in J” - (R/J)7, see also Proposition 14, 
we are testing whether f + p\"/”J+! . g lies in J” - (R/J) 7, for some element 
g € Z[x,...,Xm]. Because p!’/"J+! vanishes with order m- ([r/m| +1) > r, the 
strong triangle inequality yields that f € I”-(R/J), if and only if f+ pl"/"!+!.¢ € 
I” .(R/J)7 incase n < r. This proves the proposition. 


3.2.2 Optimising the Ideals for Fast Computation 


Another big improvement came from a slight modification of the ideals J,, as 
described in Example 13. A priori we were computing /, as J, = I” + J, where J 
is an ideal inside a polynomial ring Z[x,, ..., xg] corresponding to an affine patch 
of C, and then computing a Grobner basis. In our new implementation we defined 
ideals pnedined and J, as follows: 


eo I ifn =1, 
7 I roe -IT+J_ else, 


modified _ I ifn =1, 
2 Jn + (x € GBI Modified) - (J, : (x)) ZI) else, 


where GB gives a Grébner basis of an ideal. Basically, the elements that we added 
to pound are exactly those generators of J nn that already appear to vanish 
with order n at J. The following proposition will formalise this and prove that we 
still get the correct answer if use the ideal J™°4!"e¢ instead of In. 


Proposition 18 For any positive integer n, there is an equality 
Un/ Dr = Gre" I) 1 


of ideals of (Z[x1,...,xe]/J)1. 


Proof We prove the statement by induction to n. The case n = 1 is trivial. 

Let k > 2 be an integer and suppose the statement is true for all n < k. We 
start with the equality (j-1/J); = GE ae /J)1,. Multiplying both sides with the 
ideal (J/J)7, gives us that (k/J)7 = (Jk/J)1. Now we will prove the equality 
(Je /J)1 = UPetited 7) 7, which will finish the proof of the proposition. 

Let x € Z[x,...,x¢] by any element with (J, : (x)) ¢ J. This means that 
x multiplied with a unit in R; lies inside Jg. In other words, x lies in (Jx/J)7. 
Therefore, the equality (J; /J); = U on /J)1 follows by definition of / ae, 
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Corollary 19 Algorithm 15 is still correct if In is replaced by [Posified, 


One might wonder why we chose to use this rather artificial looking ideal [motified 
instead of J,. The following example, in which we also use our improvements from 
the previous Sect. 3.2.1, illustrates the advantage of this approach. 


Example 20 Consider the ideal 


J= a agi Oe gee) 
inside the ring sigh. y, z] equipped with the graded reverse lexicographic order. 
Now we look at the component in the special fibre given by J = (x + y, z, 2). Then 
the direct computation of a Grébner basis for [;g took 43.5, while the inductive 
computation of a Grobner basis for [ a only took 2.32 s. Moreover, the former 
Grébner basis has 61 elements, while the latter only has 30 elements. 


3.2.3 Avoiding Function Field Computations 


In our previous implementation, we computed a function field for each patch of the 
regular model. This function field was used to represent differentials on this patch. 
The computation of this function field is quite expensive. In the following example, 
it did not seem to finish within reasonable time. 


Example 21 Consider the curve 1328.a.84992.1, given by 
y+ (x + Dy = 4x5 + 9x4 + 16x3 + 13x? + 8x +1. 


The regular model that Magma computes at 2 for this curve, has a patch given 
by two equations. One of these equations has 428 terms and is of degree 56. The 
computation of a function field over Q for this patch did not finish in 24h. Although 
we could not determine why the computation took so long, we suspect that it is due 
to the exponential time needed to compute a Grébner basis for an ideal. 


However, in practice, we do not need this function field. Except for the first 
patches, every patch P is defined using three variables x, y, and z, and two equations 
f = g =0. Without loss of generality, we can assume that all differentials on P can 
be represented as h - dx with h € K(x, y, z), where K is the base field (this does 
not need to be Q, as sometimes there is need extend the base field, as explained in 
Sect. 2.1). The only operation for which we used the function field, was to convert a 
differential of the shape j - dy or j - dz into a differential of the shape h - dx. The 
following algorithm gives us a way to achieve this in polynomial time. 


Algorithm 22 Input: polynomials f, g € K[x, y, z] defining the affine patch P. 
Output: relatively prime a, b € K[x, y, z] such that a. dx = b- dy holds onP. 


Step 1 The equations df = dg = 0, give rise to the following linear relations 
over K[x, y, z] for dx, dy and dz: 
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af af af dx 

ax dy 9 

(E ae i) dy] =0 (2) 
Ox dy Oz dz 


Step 2 Consider 


Multiply Eq. (2) on the left side by the adjugate of M to get a relation of 


the form 
FDO 7 ; re 
GOD) \) 

z 


for some polynomials D, F and G in K[x, y, z]. 


Step 3 Then we compute a = and b = 


F 
gcd(D, F) gcd(D,F)* 


Proposition 23 Algorithm 22 is correct and the number of field operations in K is 
polynomial in the input size. 


Proof It is clear that equality (3) always holds. To check that Step 3 is well-defined, 
we will prove that D #4 0. Suppose D = 0. Because dx is a non-vanishing 
differential by assumption, we get F = G = 0. The Jacobian matrix 


af af af 
ox Oy az 
atts) 


has full rank, as f = g = O defines a curve in A>. Hence adj(M)N = 0 implies that 
adj(M) = 0 and M = 0. But that immediately contradicts the fact that N is of full 
rank. 

Now we will look at the number of field operations, let B be a bound for the 
number of coefficients of f and g. To compute the six partial derivatives, we need 
at most O(B) field operations. For the matrix multiplication in Step 3, we need 
to multiply polynomials with at most B coefficients with each other, so this can be 
done using at most O(B*) field operations. To see that the number of field operations 
needed in Step 3 is polynomial, we refer the reader to [Witt04, chap. 6]. 
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3.3 How to Avoid Checking p® Differentials 


In Step 6 of Algorithm 12, a candidate basis w),...,@g for the global sections of 
the canonical sheaf we z,,. is considered. For ae linear combination DF Cj; 
with the c; € {0,..., p — 1} not all ae to zero, it is tested whether s CjOj 


vanishes on the special fibre. If so, then 4 ; a j ¢j@j can replace one of the current 
candidate basis elements. 

For big values of p, the checking of p* — 1 differentials can take a lot of time. 
To check whether a differential vanishes on all components of the special fibre of 
C, quite a few expensive Grobner basis computations are needed. We would like to 
limit such com eutanens as much as possible. Of course, it would suffice to actually 
check just — , of these differentials, but for g > 2, this could still be a big number. 

If D is acomponent of the special fibre C,, then Og, p is a discrete valuation ring 
whose residue field Fp is a function field over a finite field. So in principle, checking 
whether >> fii is vanishing on D is an F,-linear algebra problem inside Fp, 
which could be solved much more efficiently. 

If we write w; = fig for polynomials f;, 4; and d is a local generator on 


a sufficiently small affine neighbourhood of the generic point of D for the sheaf 
C/Z p> AS in [vVBom19, Lemma 12], then we would need to determine the residue 


class of o inside Fp. The problem that we now encounter is that both f; and hj; 
could vanish on D with the same order n > 0. We could try to check if f; —@-hj 
vanishes on D with order greater than n, where a ranges over lifts of elements in 
F p, but then we still need to check |F p| elements, which is exactly what we wanted 
to avoid. Although the vanishing of h; on D sometimes happens, it seemed that 
in most cases there were a bunch of components for which this does not happen. 
Therefore, we decided to only use a linear algebra approach on those components 
on which h; does not vanish. We then check in the end if the differential we found 
really vanishes on the whole special fibre. 


Remark 24 An alternative approach would be to try to repeatedly divide a 
uniformiser out of f; and 4; until they both have order 0. However, the difficulty 
with this approach is, that we are doing our computations over Z/p°Z for some 
exponent e (see Sect. 3.2.1), and we cannot divide by the nilpotent uniformiser. 


Instead of actually doing linear algebra in the different function fields Fp, 
we opted for a faster approach, which works in almost all of the cases. We 
precompute a finite set S of (not necessarily rational) random closed points in 
different components of the special fibre, and specialise to these points. This leads 
to the following algorithm. 


Algorithm 25 Input: precomputed set S of closed points on the special fibre of C, 
polynomials f\,..., fe, hi,...,hg such that w; = did for j — 7) eee 
‘ J 


Output: a subspace V of FS, such that all (cj, ...,€g) € FE such that ae Cj; 
vanishes on the special fibre of C are contained in V. , 


254 R. van Bommel 


Step1 For each j = 1,..., g compute the subset S$; = {P € S : hj(P) = O}. 
Compute S’ = S$ \ Ute Sj. 

Step 2 Construct the |S’| x g matrix corresponding to the linear map FY, => FS 
mapping the j-th standard basis vector to (fj(P) - hj; (P)~!) pes. 

Step 3 Output the kernel of M. 


Proposition 26 Algorithm 25 is correct, i.e. the vector space V satisfies the output 
conditions, and the number of field operations needed is polynomial in the input 
size. 


Proof If > ; ¢j@j Vanishes on the special fibre of C, then » joi di must vanish on 
Bee Ss J 


the special fibre. In particular, ar Cj ie must vanish in any point of S’, and (c Djat 
lies in the kernel of the linear map constructed in Step 2. 

The evaluation of the polynomials in Step | is polynomial in the input size: at 
most O(g-|S|-B-log D) operations are needed, where B is a bound for the number 
of coefficients of f},...,4,, and D is a bound for their degree. For Step 2, we 
need O(g - |S’| - B - log D) operations to construct a matrix for the linear map. 
Finally, the Gauss elimination to find generators for the kernel can be done using 
O(max(g, |S’|)>) field operations. 


In most cases in our computation, the vector space V appeared to be at most one- 
dimensional and the full computation to determine whether a differential vanishes, 
had to be done at most once. In almost all cases for which the space had dimension 
greater than |, the prime p appeared to be small. 


3.4 Possible Ideas for Improvements 


Let D be a component of the special fibre of C. We propose two ideas that could 
help in the task of finding the order of vanishing of a function f at D. 

The first idea is to try to remedy the defect described in Remark 24. For example, 
if we know that an integer is 4 mod 12 and we divide it by 2, we cannot tell its value 
modulo 12, but we do know for sure that it is 2. mod 6. In the same way, we can 
reduce the modulus when we divide by a uniformiser to solve the problem described 
in Remark 24. 

The second idea comes from a comparison with methods used in real analysis. If 
you want to see if a C™ function which vanishes in a point P, is vanishing twice, 
you usually compute its derivative and check if that is vanishing. Of course, this 
does not work in characteristic p, but the Hasse derivative could work, see for 
example [Gol03, Sect. 1.3]. If there are methods to compute the Hasse derivative 
more efficiently than the Grobner basis computations that we are currently doing, 
this could lead to much smaller runtimes. 
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Average time per curve (in seconds) | first 200 curves | slowest curve 


Computation of big period matrix 1.20 1.31 
Construction of regular models 0.22 1:22 
Finding a basis of WC/Z.p) (C) 1.03 6665.40 
Total runtime 2.46 6668.03 


Fig. 1 Average runtime per curve 


3.5 Implementation 


The algorithm has been implemented by the author in Magma and used to compute 
the real period for all 66, 158 curves of genus 2 in the [LMFDB], all hyperelliptic 
curves and all but 8 non-hyperelliptic curves of genus 3 in [Suth19]. For the 8 
missing curves, we could not compute the real period due to a bug in Magma’s 
machinery to compute the big period matrix of the Jacobian of these curves. 

The average runtime was about 1.67 s per curve for the 66,158 curves of genus 2. 
For 89 of these curves the computation took more than 60 s, the longest runtime for 
a single curve being 6668 s. For the first 200 curves appearing in the database, and 
for the slowest curve, we have more detailed information showing how the runtime 
has been spent between the different parts of the computation (Fig. 1). 

Almost all of the time spent finding a basis of w/z, (C) was spent on actual 
Grébner basis computations. 


4 Other BSD Invariants 


There have also been recent computations of other Birch and Swinnerton-Dyer 
invariants of Jacobians of genus 2 curves to the [LMFDB]. In this section, we 
summarise them. 

The leading coefficients for the L-functions of these Jacobians have been com- 
puted using methods developed by Bober, Booker, Costa, Lee, Platt and Sutherland, 
to appear in [BBCLPS], which builds upon work of Booker, see [Boo06]. The code 
developed by Costa and Platt is available at [CoP119]. 

Generators for the Mordell-Weil group of the Jacobians have been computed 
using code of Stoll. This is an improvement of the old j-points code. The 
algorithm roughly consists of the following three phases: 


1. try to find a tight upper bound R for rank of the Mordell-Weil group, i.e. we want 
R to equal the rank of J, but at this point we can only prove that the rank of J is 
bounded by R; 

2. search for points in J(Q), until they generate a subgroup of rank R; 

3. saturate this subgroup to obtain the Mordell-Weil group. 


For the upper bound for the rank, the following methods are used: 
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— the 2-Selmer group of the Jacobian and the 2-Selmer set of the Pic! of the curve 
can be computed and used to obtain bounds for the rank, see [Sto01, Creu18], 

— the method of visualisation of elements of LI can be used to further improve 
these bounds, see [CrMa00, Bru04, BrF106], 

— 2-power isogenous abelian surfaces are computed and the same methods are used 
to compute upper bounds for the rank of their Mordell-Weil group, which turned 
out to be better than the upper bound found for J itself in some cases. 


For the search phase, the strategy is to look for points up to a certain height. As 
the canonical height is hard to control, a naive height function is used in this phase. 
The difference between the naive and canonical height is bounded by some positive 
real number 6, that we can actually compute. So to find all points up to canonical 
height B, one has to enumerate all points up to naive height B + 6. In the new 
version of j-points, anew modified naive height function has been used, which 
has a smaller difference with the canonical height than the classical naive height 
function, see [MiiSt16, part IV]. This reduces the size of the required search space 
significantly. 

For the saturation phase, it is important that we have found all points up to 
canonical height ¢ for some e > 0. If all generators for the free part of the subgroup 
we found have canonical height at most H, then the index of the subgroup inside 
the Mordell-Weil group can only have prime factors that are smaller than H We 
can then saturate the subgroup at every prime number p < u and in this way, we 
are sure that we found the whole Mordell-Weil group. The saturation has also been 
made faster using the modified naive height function, see also [MiiSt16, part IV]. 

With this code, we have been able to compute generators for almost all Mordell- 
Weil groups of genus 2 curves in the [LMFDB]. For the few for which we did not 
directly find generators, we found generators for an abelian surface isogenous to the 
Jacobian. The following example is the one for which it was most difficult to find a 
generator. 


Example 27 The Jacobian for the genus 2 curve 900617.a.900617.1 given by 
y* BT 6 ee x)y= x — 65x" 4+ 22497 4. 30x? + x 
is isogenous to the Jacobian of the genus 2 curve given by 
y* = —1110x° — 2790x° — 9315x* + 11160x? + 18315x7 — 9540x — 12765. 
On the latter Jacobian, we were able to find a generator of the rank 1 Mordell- 
Weil group of canonical height 16.281246 using the code of Stoll. We then 
computed the image of this point under the isogeny, and used the modified height 


bound and improved saturation, to prove that the we actually found a generator of 
canonical height 65.124982 on the Jacobian of 900617.a.900617.1. 
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For the computation of the regulators from the set of generators of the Mordell- 
Weil group, we used an algorithm which is based on an approach to decompose 
the height in different local contributions using Arakelov theory, see also [Hol12, 
Miill4, BHM20]. 
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1 Introduction 


The question of which abelian varieties arise as Jacobians of curves has a long and 
rich history. It has classically been investigated over the complex numbers as the 
Schottky Problem, using techniques from differential geometry and Hodge theory. 
In positive characteristic, some of these tools are no longer available, but instead the 
Frobenius endomorphism becomes a formidable weapon. 

In this article, we study g-dimensional abelian varieties which are defined 
over a finite field IF, of odd cardinality. The Fj-isogeny class of such an abelian 
variety A is uniquely determined by the characteristic polynomial of its Frobenius 
endomorphism. This polynomial is called the Weil polynomial of the abelian variety 
and has the following form: 


Za(th=t?8+ayt?8—!4.-.4 dg_t8*! bagt® + ag-1qtS!4---+ayq8!t+q8 € Z[t). 


Given Z,(t), one would like to determine if the isogeny class of A contains the 
Jacobian of a smooth curve over Fg. 

In genus one, the problem is straightforward, as every one-dimensional abelian 
variety is an elliptic curve, which is isomorphic to its Jacobian. For genus two, the 
problem was solved by Howe, Nart, and Ritzenthaler [HNR09] who give an explicit 
classification using only elementary restrictions involving the integers a1, a2, and q. 
Their method relies on three key facts: 


(1) the Jacobian of a curve is canonically principally polarized; 

(2) a principally polarized abelian surface is isomorphic to exactly one of the 
following: a Jacobian of a genus two curve, a product of two elliptic curves, 
or the restriction of scalars of an elliptic curve from a quadratic extension of the 
ground field; 

(3) every genus two curve is hyperelliptic. 


In particular, they take advantage of the canonical involution associated with any 
genus two curve. 

Solving this problem in higher genus appears to be significantly more compli- 
cated. For example, every curve of genus three is isomorphic to either a hyperelliptic 
curve or to a smooth plane quartic curve. Curves of the latter type generically do 
not possess any non-trivial automorphisms, so the arguments of Howe, Nart and 
Ritzenthaler for genus two cannot easily be extended to non-hyperelliptic genus 
three curves. 

In this paper, we focus on a more accessible question, namely, whether a given 
isogeny class of abelian varieties contains the Jacobian of a hyperelliptic curve. 
We do this by studying the geometric configurations of the Weierstrass points of 
hyperelliptic curves, as is done in the g = 2 case, e.g., [MN0O2, Appendix]. In doing 
so, we obtain parity conditions on the coefficients of the Weil polynomials which 
prevent certain isogeny classes from containing the Jacobian of a hyperelliptic 
curve. For instance, in genus three we obtain: 
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Theorem 2.8 Let q be an odd prime power. The isogeny classes of three- 
dimensional abelian varieties corresponding to Weil polynomials of the form 


P+apP+artt+ar+qatrt+Qat+¢ 


with ay = 0 (mod 2) and a3 = 1 (mod 2) do not contain the Jacobian of a 
hyperelliptic curve over Fg. 


In fact, we can perform the same analysis for abelian varieties of any dimension; 
as far as we know, this is the first non-existence result for Jacobians in isogeny 
classes when g > 2 and q is any odd prime power. Indeed, we demonstrate that the 
Weil polynomial of the Jacobian of a hyperelliptic curve of genus g over a finite 
field of odd characteristic must be congruent modulo 2 to a polynomial of the form 
[i=1@¢# — D/@ — 1)? € Fy[t] where 2g + 2 = d, + --- +d, is a partition. We 
call a Weil polynomial admissible if it takes this form modulo 2, and inadmissible 
otherwise. Notice that it is easy to test whether a polynomial is admissible by 
explicitly trying all possible partitions, and an isogeny class is guaranteed to not 
contain a hyperelliptic Jacobian if its Weil polynomial is inadmissible. 

By using asymptotic results on the number of Weil polynomials of a fixed degree 
whose coefficients lie in prescribed congruence classes modulo an integer, it is 
possible to determine the asymptotic proportion of isogeny classes with admissible, 
or inadmissible, Weil polynomials. 


Theorem 3.3 Let c(q, g) be the proportion of isogeny classes of g-dimensional 
abelian varieties over Fg with admissible Weil polynomial. For all g = 2 we have 


' QO(2g +2) 
lim c(q, g) = —~—_., 
q>o 28 
as q ranges over odd prime powers, where Q(2g + 2) is the number of partitions of 
2g + 2 into distinct parts. 


For comparison, notice that for fixed g > 6, it is possible to see by simply 
counting the number of isogeny classes of g-dimensional abelian varieties over Fy 
versus the number of hyperelliptic curves of genus g over F, that actually 0% of 
g-dimensional abelian varieties over F, contain a hyperelliptic Jacobian as gq — 00 
ranges over odd prime powers. However, this counting argument does not provide a 
way to identify which isogeny classes do or do not contain a hyperelliptic Jacobian. 
In contrast, for any fixed g > 1, any isogeny class with an inadmissible Weil 
polynomial is explicitly known to not contain a hyperelliptic Jacobian. Rephrasing 
the theorem above for fixed g > 6, the proportion of isogeny classes of g- 
dimensional abelian varieties over F,, which do not contain a hyperelliptic Jacobian 
but are not found by testing for inadmissible Weil polynomials is Q(2g + 2)/28 as 
q — oo ranges over odd prime powers. As noted in Remark 3.4, this discrepancy 
approaches zero as g grows. 
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On the other hand, over the algebraic closure Fy, determining which isogeny 
classes contain Jacobians remains an open question. In order for our methods to 
shed light in this context, the behavior of admissibility under finite field extensions 
must first be understood. We leave this question for future work. 

The outline of the paper is as follows. In Sect. 2, we recall some results on the 
geometry of hyperelliptic curves and their Jacobians and prove Theorem 2.8 and 
its generalizations to higher genus. In Sect.3, we study asymptotic consequences 
of these non-existence results, deriving Theorem 3.3. In Sect.4, we determine 
restrictions on point counts of curves, allowing us to reprove Theorem 2.8 by 
elementary methods. Finally, in Sect.5, we present experimental data on the 
optimality of our results. 


2 Weil Polynomials Mod 2 


Let C be a hyperelliptic curve of genus g defined over Fy (of characteristic p 4 2) 
with canonical degree two map 7: C > P!. Let W := {@1, @2,..., @2942} be the 
support of ramification divisor of zr, i.e., the geometric Weierstrass points of C. The 
IF,-Frobenius endomorphism Frob acts on W by permuting its elements; we denote 
the cardinalities of the orbits of this action by d;. (Alternatively, W consists of some 
number, r, of F,-places with respective degrees d;, where dj + --- +d, = 2g +2.) 

The multiset dc := {d;} forms a partition of the integer 2g + 2; we call this the 
degree set of the curve C, and by convention we order the d; so that d; < dz < 
tees d,. 

Explicitly, C can be given in coordinates by y? = f(x), where f is a squarefree 
polynomial of degree either 2g + 1 or 2g + 2. Then W consists of the points (a, 0), 
where @ runs over the roots of f, together with the point at infinity if deg(f) = 
2g + 1. Then the d; are precisely the degrees of the irreducible factors of f, along 
with an extra | in the case deg(f) = 2g + 1. 


Remark 2.1 Tf q is small compared to the genus, not all partitions of 2g + 2 may 
arise as degree sets of genus g hyperelliptic curves, due to the finite number of 
irreducible polynomials of any fixed degree in F,[x]. For example, if2g+2 > q+, 
then the partition 2g + 2 = 1+.----+ 1 cannot be realized, as there are not enough 
F,-points for C to ramify over. 


Let J denote the Jacobian variety of C whose elements are degree-zero divisors 
on C modulo linear equivalence. Then the group J[2] of (geometric) 2-torsion of J 
is an F2-vector space that admits the following explicit description. 


Lemma 2.2 The group J[2] of 2-torsion elements of J forms a 2g-dimensional 
vector space over F . Explicitly, this group can be expressed as the vector space 


obtained from ie by considering all vectors with an even number of non-zero 
entries and forming the quotient by ((1, 1,..., 1)). 
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Proof The first claim follows since p # 2. For the second, we argue as follows; cf. 
[Mum07, Corollary 2.11] over C, which extends to any algebraically closed field of 
characteristic unequal to two. Every element of J[2] can be explicitly represented 
either by a divisor 


ev = 9 P—|U\(x), 


PeU 


if the map 2: C — P! is ramified at infinity, in which case the symbol oo slightly 
abusively denotes 1~!(00) € C, or by a divisor 


|U| 
ey = > P— gq rl + 002), 
PeU 


if w is split at infinity, in which case m~!(00) = {00], 009}. Either way, the set 
U C W is a subset of even cardinality, and two divisors, ey and ey’, represent 
the same element of J[2] if either U = U’ or U = W \ U’. In the F -vector 
space V whose standard basis is indexed by W, the above set of representatives 
determine the subspace of vectors with an even number of non-zero entries. Two 
vectors v1,U2 € V yield the same element of J[2] precisely when their sum is 
contained in ((1, 1,..., 1)). The second claim follows. oO 


We will also need the following standard fact about vector spaces, whose proof 
is omitted. 


Lemma 2.3 Consider an exact sequence of vector spaces 


0- Wi — W2 > W3 > 0 


and a linear map T: W2 — W? such that T(W\) © W,. We will also denote the 
induced map T: W3 — W3. Fori € {1, 2,3}, denote the characteristic polynomial 
of T on W; by x(T, W;). Then we have 


x(T, W2) = x(7, Wi)x(T, W3). 


Denote the base-change of C (resp. J) to F, by Cs (resp. J*!2), Exploiting the 
action of Frob on the geometric Weierstrass points W, we obtain the following result 
on the characteristic polynomial of Frob modulo 2. 


Proposition 2.4 Let C be a hyperelliptic curve of genus g defined over Fy. Let 
{dj};_, be the partition of 2g + 2 which records the sizes of the orbits of Frobenius 
acting on the 2g + 2 geometric Weierstrass points. For any prime € # p we have 


det (1 — Frobr| HA(C*8, Qu) = (TI tdi — 1)/i —1)? (mod 2). (2.5) 
i=l 
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Proof We have (cf. [Mil86, Corollary 9.6]) that 


det (1 — Frobt | HAcis, Q)) = det (1 — Frobr | Ha (Js, Q) 
= det (1 — Frobt | Ve(J)), 


where Ve(J) = Te(J) @z, Qe, and where T(J) is the €-adic Tate module of J. By 
the Weil conjectures the polynomials above all have integer coefficients. By taking 
£ = 2, we further have 


det (1 — Frobt | V2(J)) = det (1 — Frobt | J[2]) (mod 2). 
As before, consider the F2-vector space W2 := F58 ave whose standard basis is 
indexed by W and which is acted on by the F,-Frobenius endomorphism Frob. By 
assumption, the action of Frob on W2 can be represented by a block-diagonal matrix 
with r blocks, whose ith block (of order d;) is a cyclic permutation of basis vectors. 


Since the characteristic polynomial of a cyclic permutation of order n is t” — 1, it 
follows that 


r 


x(Frob, W) = | | (r# — 1). 
i=l 


Furthermore, let W; be the codimension-one subspace of vectors with an even 
number of non-zero entries. Then W, is stable under Frob. Moreover, Frob acts 
invertibly on W, and hence also on the one-dimensional quotient W3 := W2/W,, 
so that x(Frob, W3) = t — 1. Applying Lemma 2.3 to the short exact sequence 
0-> Wi > W2 > W/W, — 0 yields 


Frob, W1) = 
x(Frob, W1) i 


Now let W, be the one-dimensional subspace of ((1, 1,...,1)) G W and define 
Ws = W,/W4. Similarly, W4 is stable under Frob with x (Frob, W4) = t — 1 since 
Frob acts invertibly. Applying Lemma 2.3 to the short exact sequence 0 > W4 > 
W, — Ws — 0, we find 


Cat= 1) 


x (Frob, Ws) = rai 


Since, by Lemma 2.2, Frob acts on Ws as it does on J[2], the result follows. oO 


Remark 2.6 It is possible to prove a congruence modulo 2 similar to (2.5) in 
Proposition 2.4 also when the finite field F, is allowed to have characteristic 2. 
In this case, Frobenius acts on Weierstrass points as it does on the étale part of 
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Table 2.9 Weil coefficients modulo 2 and corresponding partitions for threefolds 


Coefficients (a1, 42, a3) (mod 2) | Partition of 8 


(0, 1, 1) {3, 5} 

(1, 1, 0) {1, 1,1, 1, 1, 3}, (1, 1, 1, 2, 3}, {1, 2, 2, 3}, {1, 3, 4} 
(1, 0, 0) {1, 1, 1, 5}, (1, 2, 5} 

(0, 0, 0) {1, 1, 3, 3}, (1, 1, 6}, {2, 3, 3}, {2, 6} 


(0, 1, 0) {1,1,1,1,1,1,1, 1}, (1, 1,1, 1, 1, 1, 2}, (1, 1,1, 1, 2, 2}, 
{1, 1, 1, 1, 4}, (1, 1, 2, 2, 2}, {1, 1, 2, 4}, 
{2, 2, 2, 2}, {2, 2, 4}, (4, 4}, {8} 

d, 1, 1) {1,7} 


J[2]. Since we require odd characteristic throughout the rest of the paper, notably 
in Theorem 3.1, we do not pursue this further. 


Definition 2.7 We call any polynomial of the form (2.5) an admissible Weil 
polynomial modulo 2. Note that the notion of admissibility is independent of q. 


By applying Proposition 2.4 to all possible partitions of 8 we obtain our main 
theorem. 


Theorem 2.8 Let q be an odd prime power. The isogeny classes of three- 
dimensional abelian varieties corresponding to Weil polynomials of the form 


ee war i aot* + a3t? + gant” + q°ait + q 


with ay = 0 (mod 2) and a3; = 1 (mod 2) do not contain the Jacobian of a 
hyperelliptic curve over Fg. 


Proof For each of the twenty-two partitions {dj} of 2g + 2 = 8, we compute the 
corresponding polynomial (t — 1)~? Tie (t% — 1). By Proposition 2.4, these are all 
of the admissible Weil polynomials modulo 2 for Jacobians of hyperelliptic curves 
of genus three. In Table 2.9, we tabulate the coefficients (a1, a2, a3) of the resulting 
Weil polynomials modulo 2 with the corresponding partitions {d;}. oO 


Although Theorem 2.8 only considers the case of three-dimensional abelian 
varieties, Proposition 2.4 applies much more generally. Indeed, for any g > 1, 
we can produce a list of admissible Weil polynomials modulo 2 for Jacobians of 
hyperelliptic curves of genus g, independent of g. The following result counts the 
number of admissible polynomials in terms of partitions. 


Proposition 2.10 The number of admissible Weil polynomials modulo 2 for Jaco- 
bians of hyperelliptic curves of genus g over finite fields of odd characteristic is 
equal to Q(2g + 2), the number of partitions of 2g + 2 into distinct parts, or 
equivalently, the number of partitions of 2g + 2 into odd parts. 


Proof It is well known that the number of partitions of 2g + 2 into distinct parts is 
equal to the number of partitions of 2g + 2 into odd parts; see [NZM91, Theorem 
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10.2]. For the remainder of this proof, our perspective will focus on partitions into 
distinct parts. 

Proposition 2.4 shows how to compute admissible Weil polynomials modulo 
2 using partitions, although many different partitions can correspond to a single 
admissible polynomial. For the purposes of this proof, we will call two partitions 
{d1,...,d,-} and {e1,..., es} of an integer n equivalent if 


i Ss 


I] (‘#7 -1)= I] (t®/ — 1) (mod 2). 


i=1 Jal 


Using Eq. (2.5), it suffices to prove that every equivalence class of partitions of 
2g + 2 contains precisely one partition with distinct parts. 

Observe that every partition of 2g + 2 is equivalent to a partition with distinct 
parts. Indeed, if {d),...,d,+1} is any partition of 2g + 2 where d, = d,41, then 
we can construct another partition {d),...,d-—1, d/.} with d! = 2d,. These two 
partitions are equivalent since 124 — 1 = (¢% — 1)? mod 2. Thus by induction 
on the number of equal parts, we conclude that every admissible Weil polynomial 
modulo 2 arises as a partition of 2g + 2 into distinct parts. 

Now suppose that {d),...,d,} and {e,..., es} are two equivalent partitions of 
2g + 2 into distinct parts. Without loss of generality, we order the parts so that 
d, and e; are the smallest parts of their respective partitions. After expanding the 
polynomials above, we see that the non-constant monomials of smallest degree 
are t@| and f¢!, respectively. This implies that dj = e1, hence {d2,...,d,} and 
{e2,..., és} are equivalent partitions of 2g + 2 — d,. Thus by induction on r, we 
conclude that two equivalent partitions of 2g + 2 into distinct parts are equal. Oo 


Remark 2.11 In contrast with Remark 2.1, every admissible Weil polynomial mod 
2 arises as the reduction mod 2 of a Weil polynomial for a hyperelliptic curve over 
F,, for each q: Since there is at least one irreducible polynomial of every degree, 
we can construct a hyperelliptic curve ramified at points of degree d;, where {d;} is 
the partition of 2g + 2 into distinct parts. 


In Table 2.12, we tabulate the number of admissible and inadmissible Weil 
polynomials modulo 2 for Jacobians of hyperelliptic curves of genus g < 7, i.e., 
the numbers Q(2g + 2) and 2% — Q(2g + 2). 


Table 2.12 The number of admissible and inadmissible Weil polynomials modulo 2 for hyperellip- 
tic curves of small genus 


g 1 2 3 4 5 6 q 

O(2¢ +2) ,) 4 6 10 15 2 32. 
100.00% | 100.00% | 75.00% | 62.50% | 46.88% | 34.38% | 25.00% 

28 — Q(2g+2) |0 0 a 6 17 42 96 


0.00% 0.00% 25.00% | 37.50% | 53.12% | 65.62% | 75.00% 
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3 Asymptotics 


The results in the previous section show that an isogeny class cannot contain a 
hyperelliptic Jacobian if the coefficients of the corresponding Weil polynomial 
satisfy a certain parity condition. Therefore, it is natural to ask how many Weil 
polynomials satisfy any given parity condition, or more generally, how many Weil 
polynomials are congruent to a given fixed polynomial of the correct form modulo 
an integer m. The following theorem answers this question asymptotically for 
abelian varieties of fixed dimension. 


Theorem 3.1 Let g > 2 and m be fixed integers, and let f(t) € Z[t] be a fixed 
polynomial of the form 


f(t) = 078 4ayt?8 | $..-tag_1t8t! +agt® +ag_igt® | +---+aige 't+¢8. 


For a prime power q coprime to m, write é ¢.m(q, g) for the proportion of isogeny 
classes of g-dimensional abelian varieties over Fg whose Weil polynomial is 
congruent to f (x) modulo m. Then 


plu efm(, 8) = me’ 


where the limit is taken over all prime powers q coprime to m. 


Proof This is essentially a theorem of Holden [Hol04, Theorem 5], although we 
present a slightly more general statement here. Specifically, Holden’s theorem only 
considers the case when m = @ is a prime, and the limit is taken over g = p’ fora 
single fixed prime p. However, the more general version of the theorem presented 
above is obtained immediately from Holden’s methods, as follows. Denote by 
I(q, g) the number of isogeny classes of g-dimensional abelian varieties over F, 
and write I'¢m(q, g) for the number of such isogeny classes whose Weil polynomial 
is congruent to f(x) modulo m. With this notation we may write efm(g,g) = 


tf te) Using lattices, DiPippo and Howe obtained upper and lower bounds 


for I(g, g); see [DH98, Theorem 1.2] and [DH00]. Holden employed analogous 
techniques to obtain similar bounds for I¢m(q, g); see [Hol04, Proposition 2.2]. 
Using these bounds, as in the proof of [Hol04, Theorem 5], we find 


ver (qq St)/4m-8 — 2c(g, m)g8 Et V/4-1/2m1-8 
ver (q)q88tD/4 + (ug + 3c(g, D)q88tD/4-1/2 


2 Tpm(@, g) 

1(q, 8) 
7 ver (q)q88tV/4m-8 + (Ug + 3e(g, 1))g884D/4- 1/2 m1 
~ ver (q)qgs8tb/4 — 2c(g, m)g88tD/4—-1/2 


? 
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where vg is a constant depending g, and c(g, m) is a constant depending on both g 
and m, andr(q) = g(q)/q, where g denotes Euler totient function. Letting g > o, 
the theorem follows. Oo 


Recall that, in light of Theorem 2.8 and Proposition 2.10, we say that the Weil 
polynomial of an isogeny class of g-dimensional abelian varieties is admissible if it 
is congruent modulo 2 to a polynomial of the form Tix (= l/(?- 1) € Fo[t] for 
some partition 2g +2 = d| +---+d,, and is inadmissible otherwise. By combining 
Theorems 2.8 and 3.1, we obtain the following corollary. 


Corollary 3.2 Denote by c(q) the proportion of isogeny classes of abelian three- 
folds over Fy with admissible Weil polynomials. Then 


lim c(q) = 75%, 
q-00 


where the limit is taken over all odd prime powers q. 


More generally, by combining Theorem 2.8 and Proposition 2.10, we can 
determine the proportion of isogeny classes of g-dimensional abelian varieties over 
IF, with admissible Weil polynomials in terms of the number of partitions of 2¢ + 2 
into distinct parts. 


Theorem 3.3 Let c(q, g) be the proportion of isogeny classes of g-dimensional 
abelian varieties over Fg with admissible Weil polynomial. For all g = 2 we have 


' O(2g + 2) 
lim c(q, g) = —~—_., 
q>0 28 
as q ranges over odd prime powers, where Q(2g + 2) is the number of partitions of 
2g + 2 into distinct parts. 


Proof By Proposition 2.10, we know there is a set of polynomials S = 
(fi, .--, fo(2g+2)} such that the Weil polynomial of the Jacobian of a hyperelliptic 
curve of genus g over any finite field of odd characteristic is equivalent modulo 2 to 
some polynomial in S. Thus, the result follows immediately from Theorem 3.1. O 


Remark 3.4 For fixed g > 2, the number of isogeny classes of g-dimensional 
abelian varieties over F, and the number of hyperelliptic curves of genus g over Fy 
are asymptotically bounded by g8‘8+!)/4 and q?8~!, respectively, as g > 00 ranges 
over odd prime powers; see [DH98, Theorem 1.1], and [Nar09, Table 1, Corollary 
3.4]. For g > 6, this means that asymptotically 0% of the isogeny classes of g- 
dimensional abelian varieties over F, contain a hyperelliptic Jacobian as q > oo 
ranges over odd prime powers. Comparing this reality to Theorem 3.3, we see 
that the proportion of isogeny classes of g-dimensional abelian varieties that have 
an admissible Weil polynomial but still do not contain a hyperelliptic Jacobian is 


oces?) as q — oo. Notice that this discrepancy gets smaller as g grows. Indeed, 


O(N) ~ 245 exp(r/N73) as N > oo [FS09, Figure 1.9], so it follows that 
O29) 
& 


> O0asg>ou. 
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4 Point Counts 


In Sect. 2 we determined restrictions modulo 2 for Weil polynomials of hyperelliptic 
Jacobians. These Weil polynomials govern the point counts of the corresponding 
hyperelliptic curve C/I, over all extensions of Fy. In this section, we determine 
2-adic restrictions on the point counts of hyperelliptic curves over extensions of Fy 
by more elementary means. Rather than studying the action of Frobenius on J[2], 
we Study its action on the Weierstrass points of the curve directly. In particular, this 
provides an alternative proof of Theorem 2.8. 


4.1 Restrictions on Parity of Point Counts 


As before, let g be an odd prime power, C/I, be a hyperelliptic curve of genus 
g > 1. Recall that we denote the support of the ramification divisor by W and we 
denote by {d;} a partition of 2g + 2, corresponding to the decomposition of W into 
Frobenius orbits. We begin with the following observations on the point counts of 
C over extensions of Fy. 


Lemma 4.1 For eachn > 1, we have #W(Fgn) = idan dj. 
Proof A point of degree d contributes d Fgn-points if d|n, and none otherwise. O 
Lemma 4.2 For each n > 1, we have #C(Fqx) = #W(Fqx) (mod 2). 


Proof The Fgn-points of W are precisely the Fyn-points of C that are fixed by 
the hyperelliptic involution. Since all other points appear in pairs, it follows that 
#C (Fg) = #W(F~) (mod 2). Oo 


Corollary 4.3 For eachn > 1, we have #C(Fgn) = #C (Fy2) (mod 2). 


Proof The numbers n and 2n share the same set of odd divisors, therefore by 
Lemmas 4.1 and 4.2 


#C(Fqr) = #W(E gn) = SS d; = pS dj = #W (Fj) = #C(Ej2) (mod 2). 
i:dj|n i:d;|2n 


Corollary 4.4 For each n, we have 


n-#(i:d;)=n}= Px j(n/d)#C(Fja) (mod 2), 
d\n 


where jt is the Mobius function. 


Proof Using Lemma 4.1 we can restate Lemma 4.2 as 
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#C (Eqn) =) d-#{i : d; = d} (mod 2). 
d\n 


The result follows by applying MGbius inversion. Oo 


Remark 4.5 For C as above, consider the binary sequence 
ac(n) = #C(Fqx) (mod 2). 


Lemma 4.2 and Corollary 4.4 imply that ac determines, and is determined by, 
the numbers n - #{i : d; = n} (mod 2) for all n. These numbers are encoded in the 
degree set dc; they tell us the parity of #(@ : dj = n) when n is odd, and give no 
information when n is even. 

More precisely, Lemma 4.2 and Corollary 4.4 give us a dictionary between the 
sequence ac of parities of point counts and the set 


{d : d is odd and appears an odd number of times in dc}. 


This is consistent with Proposition 2.10. 


This is already enough to prove that some sequences of point count parities are 
inconsistent with the identity }* d; = 2g + 2. For example: 


Lemma 4.6 /fthe genus of C is 3, then we cannot have #C(F,) = 0, #C(Fy3) = 1, 
and #C(F,5) = 0 (mod 2). 


Proof If sucha curve C existed, Corollary 4.4 implies that #{d; = 1} is even, #{d; = 
3} is odd, and #{d; = 5} is even. Thus #{d; = 3} = 1 and #{d; = 5} = 0, which 
contradicts #{d; = 1} being even. oO 


4.2 Restrictions on Point Counts Modulo Powers of 2 


We have just seen some restrictions on the point counts modulo 2 of hyperelliptic 
curves. In this section we will obtain further restrictions modulo higher powers of 2. 

Fix an integer m > 1. Let G be the group {+1} x Gal(Fj2 /F,) of order 
2”+!_ This acts on the F,.-points of C, where the first factor acts by the 
hyperelliptic involution and the second by field automorphisms. We will study 
#C(F q2™) (mod 2”*!) by examining the orbits of the action of G. 


Proposition 4.7 If C is a genus g hyperelliptic curve over Fy, then 
#C(Fgn) = 2(q" + 1) —#W(F gn) (mod ae 


where n = 2% -m > 1 withm odd. 
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Proof We will count the Fgx-points of C by considering the fibers of the hyperel- 
liptic map 7: C(Fgn) > P! (Fyn). Ifxe P! (Fyn), then x ~!(x) can be computed 
by extracting the square roots of f(x), where f is a polynomial of degree 2g + 1 
or 2g + 2. Accordingly, x~!(x) contains either zero, one, or two Fn-points, and 
a preimage of size one occurs if and only if x € W. Further, if n is even and x 
is defined over Fjnj2 (equivalently, if its degree is not divisible by 2°), then the 
preimage must have size two, since quadratic equations y? = f (x) over Fgn/2 can 
be solved over Fgn. Thus, the preimage may have size zero only if the degree of x 
is divisible by 2%. In this case, the Galois orbit of x has size divisible by 2°, so if 
there are Fgn-points above x, they occur in an orbit whose size is divisible by ger 
Hence, to count modulo 2¢+!, we may assume that all unramified Fgn-points of P 
have exactly two preimages defined over Fy, and subtract the number of Fj-points 
of W to obtain the result. Oo 


a 


As with Proposition 2.4, this implies that certain sequences of point counts 
of hyperelliptic curves (equivalently, certain Weil polynomials of their Jacobians) 
cannot be realized. 


Corollary 4.8 Assume that the genus of C is 3. If #C(F,) = 1 (mod 2) and 
#C(F,3) = 0 (mod 2), then we have #C (Fj2” ) =—1 (mod 2+!) for all m > 1. 

Proof Since #C(F,) is odd, we have that #{dj = 1} is odd by Lemma 4.2. 
Analogously, since #{d; = 1} is odd and #C (Fy3) is even, it follows that #{d; = 3} 
is odd and hence must equal one. Altogether, we have d; = 3 for a unique j and 
dj: € {1,2,4} for j’ # j. Hence # (Ups WEF jot) = 5, and so #W(F,2) = 5 


(mod 2”"+!). The result follows by applying Proposition 4.7 ton = 2”, since we 
have g7" =1 (mod 2+!) for g odd. o 


4.3 Reproving Theorem 2.8 


The results in the previous subsections can be used to reprove Theorem 2.8 as 
follows: 


Alternative Proof of Theorem 2.8 Suppose that the statement is false, so that there 
is some hyperelliptic curve C of genus three whose Jacobian has the prescribed Weil 
polynomial. By using Newton’s formulae and [Mil86, Theorem 11.1], we can write 
the point counts #C (Fyx) in terms of the coefficients of the Weil polynomial for all 
k > 1, as follows: 


#C(Fy) =qt+1+a; 


#C(F2) = q? + 1 — aj + 2a0; 


#C(Fj3) = q° + 1 +a} — 3ara + 3a3; 
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#C(Fj4) = q* + 1 — aj + 4ajay — 4aya3 — 2a5 + 4gay; 


#C( ye) = qg +1+ a; - Sajan + Sayay — 5qajaz — 5aza3 + 5q°a. 


We will complete our proof by showing that the restrictions a7 = 0 (mod 2) and 
a3 = 1 (mod 2) on the coefficients force the point counts to fall into the impossible 
cases listed in the lemmas above. First consider the case when aj = O (mod 2). 
Expanding the equations above leads to 


#C(F,) =0 (mod 2), #C(F,2) =2 (mod 4), 


#C(Fj3) = 1 (mod 2), #C(Fj4) = 2 (mod 8), 


#C(F,5) = 0 (mod 2). 


In particular, Lemma 4.6 applies, and we reach a contradiction. 
Now assume that a; = 1 (mod 2). The analogous computation gives 


#C(F,) = 1 (mod 2), #C(F,2) = 1 (mod 4), 


#C(F73) =0 (mod 2), #C(Fj4) =5 (mod 8), 


#C(F,s) = 0 (mod 2). 


Corollary 4.8 applies, and we again reach a contradiction. Oo 


Remark 4.9 The argument above can be automated, and by iterating over partitions 
of 2g + 2, and applying Proposition 4.7, one can rule out Weil polynomials modulo 
2, similarly to Proposition 2.10. We have verified that the sets of unrealizable Weil 
polynomials obtained by these two different procedures agree up to genus g < 10 
by using the Newton identities. Indeed, we expect both sets to match for any g, 
although we are unable to prove it in general. 


5 Experimental Data 


The main results of this paper were inspired by experimental data, which we include 
here to illustrate the phenomena. By performing an exhaustive search, the number of 
Jacobians of hyperelliptic curves in each isogeny class over F, (up to isomorphism 
of principally polarized abelian varieties) has been computed for threefolds and 
prime powers gq < 13 that are either prime or odd. Similarly, by iterating over 
the isomorphism classes of smooth plane quartic curves, the number of isogeny 
classes of abelian threefolds which contain the Jacobian of a smooth plane quartic 
curve has been computed for g = 2, 3, and 5. Both searches were done by Andrew 
Sutherland using the techniques developed in [KS08, KS16], and the data have been 
incorporated in the L-functions and Modular Forms Database [LMF19]; see www. 
LMFDB.org/Variety/Abelian/Fq/. 
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Table 5.1 Types of Jacobians per isogeny classes of abelian threefolds 


Contains Jacobian Total 

q Yes No Hyperelliptic Quartic curve 

2 “108 107 59 B 215 
50.23% 49.77% 27.44% 33.95% 

3 479 198 297 389 677 
10.75% 29.25% 43.87% 57.46% 

5 2611 342 Li25 2471 2953 
88.42% 11.58% 58.35% 83.68% 


By combining these data sets, one can also deduce which isogeny classes of 
abelian threefolds do not contain a Jacobian for g = 2,3, and 5, see Table 5.1. 
These data sets, and more precisely the multisets of virtual point counts modulo 2 
and 4 that can be extracted from this data, provide motivation for Theorem 2.8. 

In Fig. 5.2, we compare the proportion of isogeny classes of abelian threefolds 
over Fy which do not contain a hyperelliptic Jacobian for gq < 13 with the proportion 
of such isogeny classes which are ruled out via Theorem 2.8, giving us some insight 
into the efficiency of Theorem 2.8 as g grows. Examples of isogeny classes which 
do not contain a (hyperelliptic) Jacobian but are not ruled out by Theorem 2.8 are 
given by the following Weil polynomials, cf. isogeny classes 3.3.a_ab_ac and 
3.5.ac_j_awon the LMFDB [LMF19]: 


Wiha af = 2r a 3P 407, 
Z(t) = t© — 20° + 9f4 — 2277 + 4517 — 50 + 125. 


In fact, neither isogeny class contains the Jacobian of any curve. These isogeny 
classes correspond to the row of Table 2.9 with the most partitions. 

In Table 5.3, we present the number of isogeny classes which are ruled out by 
Theorem 2.8, for 17 < g < 31. The data show that the proportion of isogeny classes 
ruled out by Theorem 2.8 quickly approaches 25%, as expected; see Corollary 3.2. 
Furthermore, naive extrapolation seems to indicate that the proportion of isogeny 
classes of abelian threefolds over F, which do not contain a hyperelliptic Jacobian 
approaches 25% from above as g — ov ranges over odd prime powers, while the 
proportion of the isogeny classes that are ruled out via Theorem 2.8 approaches 
25% from below. Similarly, in Tables 5.4 and 5.5 we display the number of isogeny 
classes which cannot contain a hyperelliptic Jacobian by Proposition 2.4, and as 
before, we observe proportions which are already very close to the ones attained in 
the q-limit; see Corollary 3.2. 

To generate Tables 5.3, 5.4 and 5.5, we enumerated all isogeny classes through 
their Weil polynomials: First, we enumerated Weil polynomials of degree 2g, and 
then we filtered by the Honda-Tate condition on its factors, see [Wat69, Chapter 
2], to only keep the ones that correspond to an isogeny class of abelian varieties 
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Fig. 5.2 Effectiveness of Theorem 2.8 


Table 5.3. Isogeny classes ruled out by Proposition 2.4 for genus 3 and small g 


Number of 
isogeny classes 


Theorem 2 


8 


112,283 | 156,589 | 277,517 | 332,166 | 333,695 | 555,843 | 678,957 
Inadmissible by | 27,974 | 39,034 | 69,268 | 82,564 | 83,350 | 138,730 | 169,574 
24.91% | 24.93% | 24.96% | 24.86% | 24.98% | 24.96% | 24.98% 


Table 5.4 Isogeny classes ruled out by Proposition 2.4 for genus 4 and small ¢ 


Number of 
isogeny cla 


132,839 
sses 


705,593 | 2,232,114 | 6,718,947 


15,477,119 


Inadmissible 3856 48,910 | 262,564 


by 


Proposition 2.4 


829,189 


2,513,570 | 5,794,772 


35.17% | 36.82% | 37.21% 


37.15% 


37.41% 37.44% 


37.50% 
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Table 5.5 Isogeny classes ruled out by Proposition 2.4 in genus 5 and small q 


q 3 5 q>o 
Number of isogeny classes 267,465 pi 1,902,325 7 
Inadmissible by Proposition 2.4 137,866 6,286,570 

51.55% 52.82% 53.12% 


of dimension g. We enumerated Weil polynomials using root-unitary,' which 
implements the techniques introduced in [Ked08]. 
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Zen and the Art of Database m®) 
Maintenance spooks 


Edgar Costa and David Roe 


Abstract The last decade has seen a proliferation of online mathematical resources. 
We discuss some of the technical challenges involved in creating and maintaining a 
mathematical database. In particular, we report on the transition of the L-functions 
and Modular Forms Database (LMFDB) between two database systems. We also 
highlight some of the improvements to the LMFDB that we have made as part of 
this transition. 


1 Introduction 


Mathematics has a long history of using computations to aid in forming conjectures 
and searching for counterexamples. In the past few decades, computers have taken 
on a central role, both in performing many calculations and in hosting the results. 
As computational and storage capacity has increased, the size of these results has 
grown to the point where the task of searching and maintaining the data requires 
specialized knowledge. We discuss some of the challenges involved, as well as the 
main tools available to address them. We focus on one particular case study: the 
effort to migrate the database system supporting L-functions and Modular Forms 
Database (LMFDB) [29] from MongoDB [31] to PostgreSQL [38] 

We hope that this article will be useful to mathematicians aspiring to create or 
improve their own databases, to database engineers who are considering switching 
from MongoDB to PostgreSQL, and to mathematicians participating in large 
software projects for whom some of the LMFDB’s lessons might prove useful. 

We begin with a survey of existing mathematical databases to highlight the 
diversity of approaches used to disseminate mathematical data. While some focus 
on communicating theorems and propositions, we pay more attention to those which 
center on examples. Among these, we observe a range of search functionality. 
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Since the LMFDB prioritizes the ability to search by properties of each object, we 
hope that our experience working with MongoDB and PostgreSQL will help other 
mathematical resources improve their capabilities in this area. 

Next, we consider some of the important choices involved in setting up and 
maintaining such a database. We divide this discussion into three parts: how to 
encode and manipulate the data, how the user interacts with the database, and where 
it is physically stored. In each category, we present several options and a brief 
analysis of their pros and cons. 

The technical core of the article is the comparison of MongoDB and PostgreSQL, 
along with our experience in switching between them. We were motivated to 
change systems primarily for performance reasons: many searches that we wanted 
to support stalled on common inputs. We did not see a way to resolve the issue 
within MongoDB. Several missing features in MongoDB also pushed us toward 
shifting: support for arbitrary precision integers and transactions that allow multiple 
commands to be rolled back if there is a failure.! We summarize the major 
differences, including the distinction between a relational database and a document- 
oriented database, and describe some of the features of each that were important 
in different stages of the LMFDB’s history. We also outline our process for both 
porting the data and updating the codebase to use a different system. 

We finish by highlighting some of the benefits provided by performing such a 
major structural change. For example, the process of rewriting a large amount of 
code allowed us to refactor and encapsulate many of the LMFDB’s core functions. 
Moreover, these code changes enabled us to dramatically improve performance and 
add new features, such as verification suites that check for consistency within the 
database. Going forward, the ability to make connections between different kinds of 
mathematics is one of the core aims of the LMFDB, and a relational database will 
help us achieve this goal. 


2 Using Databases in Mathematics 


In order to put our work on the LMFDB in context, we give an overview of some 
other existing mathematical databases. 

We will be primarily interested in datasets consisting of information on math- 
ematical objects, such as specific number fields, graphs or curves, rather than 
on mathematical statements. There are many valuable resources focusing on 
theorems and propositions, from general encyclopedias such as the Encyclopedia of 
Mathematics [16] and Mathworld [48] to the preprint server arXiv [5] to numerous 
domain specific references [33, 14, 1]. However, the design imperatives for such 
projects favor a different type of database than the ones we are interested in. Since 
the content of theorem-oriented sites is likely to be textual, the standard tools for 


'MongoDB 4.0 added support for transactions but was released after we switched to PostgreSQL. 
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internet searching such as Google tend to work effectively. Moreover, because 
humans are directly creating the content, the tools for inputting and managing data 
are different. Wiki [23, 1, 33, 6] and blog [3] software is often used to make it 
easier for many users to be involved in content creation. Finally, when algorithms 
are generating data, it is possible to end up with orders of magnitude more content, 
leading to various challenges of scale not faced by human-created datasets. 

One of the main roles that a mathematical database plays is to record the results 
of interesting computations. Such collections of examples are useful in many areas 
of mathematics, including group theory [49, 7, 22, 15], Lie theory [2, 8], graph 
theory [10], knot theory [28], integer sequences [44], Euclidean geometry [26], 
algebraic geometry [3] and number theory [11, 29]. In each case, much of the utility 
of the database is shaped by how the results are displayed to a user, whether as a 
list of integers with annotations, a picture of a knot or a webpage showing various 
invariants of the object. Some databases [30, 7, 22] are embedded within computer 
algebra systems such as Sagemath [42], Magma [9], or GAP [18], which allows a 
user to perform further computations with the objects in the database easily. Others 
are available online in various formats; we will discuss different methods of data 
distribution in Sect. 3. 

Collecting a large number of examples into one database makes possible a 
valuable paradigm for experimental mathematics: the ability to search through 
examples for a few that have particular specific desired properties or are coun- 
terexamples to a conjecture. In addition to the LMFDB, a number of other 
mathematical databases also have search functionality as one of their core features 
[8, 41, 44, 37, 27, 10, 28]. Searching requires a more substantial infrastructure than 
is needed for just displaying data; the discussion in Sect. 4 highlights some of the 
factors to consider when choosing a search engine. 

A final role played by many online mathematical databases is to provide domain- 
specific exposition surrounding the examples that they host [11, 14, 1,23, 41, 28, 15]. 
The LMFDB aims to provide enough definitions and background for mathematically 
literate users to be able to understand the contents. The main tool for doing so is a 
webpage element called a knowl, which is a bit of context-free knowledge. Within 
the LMFDB, knowls are embedded inside webpages or other knowls, and expand to 
provide more information when clicked. Such exposition is an important part of the 
presentation of a database, but is peripheral to our main topic of the functioning of 
the underlying system. 

Another important aspect of a database project that falls beyond this article’s 
scope is the social and financial efforts required to sustain a collaboration. There are 
numerous examples of projects that are no longer accessible because the authors 
have moved on or passed away. Projects have various ways of reaching out to 
potential developers, ranging from academic workshops and conferences, to the 
standard open source tools such as GitHub [19], to forms allowing users to easily 
submit data [41, 10, 44]. In this article, we limit our attention to the technical side 
of the job: from setting up hosting servers, to the different options for database 
software, to details on how the LMFDB functions. 
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2.1 The LMFDB Project 


The Langlands program, first formulated by Robert Langlands in the 1960s, 
is a set of widespread conjectures aimed at understanding and explaining the 
interconnections between a dizzying array of subfields of mathematics, including 
number theory, representation theory, algebraic geometry, and harmonic analysis— 
and in the twenty-first century its reach continues to expand. The Langlands program 
has been called “one of the biggest ideas to come out of mathematics in the last 50 
years” and “the Grand Unified Theory of mathematics” [17, p. 3]. 

To provide compelling visual and computational displays of the Langlands pro- 
gram “in action,” a database was created called the L-functions and Modular Forms 
Database (LMFDB), available at the website https://www.lmfdb.org/. The LMFDB 
was first conceived in 2007 [12] and remains the object of a significant amount of 
ongoing work by over one hundred mathematicians [29, Acknowledgments]. 

The LMFDB hosts a variety of databases, including some predating the project 
[13, 24, 25], and connects them through the Langlands program; we recommend 
online experimentation or [12] for more details. 


3 Nuts and Bolts 


When it comes to hosting a mathematical database, there are a wide range of options 
available. Until the last several decades, the only choice was to use a print format, for 
example in a book or an article, and rely on a publisher to distribute its content and 
to make it available to the rest of scientific community. Since the publisher takes 
care of most of the technical issues, this method offers the benefit of simplicity 
to the author. Still, distribution in print provides several disadvantages compared 
to digital distribution via the internet. For instance, the act of publishing through 
print is a lengthy process, and the data is less immediately accessible than online 
distribution, putting barriers in the way of the database user. Furthermore, once the 
data is published, it is challenging to address mistakes, add new data or features. 
Finally, it can only accommodate small databases, due to the physical requirements 
to print. 

On the other hand, given that the digital distribution of mathematical data is not 
taken on by publishers, authors have many more choices to make. We break down 
these choices into the following related categories: 


¢ back end—How to encode and manipulate the database in a machine readable 
format. 

¢ front end—How the user interacts with the database. 

¢ hosting—Where to physically store the database. 


We will overview some of the possible options in each of these categories 
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3.1 Back End 


We organize the options for how to encode the database in a machine readable 
format by how easy it is to search for content. 


Human Readable Files Storing the data set in a human readable format, like a 
plain text file or a comma-separated values file, is one of the most straightforward 
possible solutions. It does not require special software to access the data, very 
similar to a table in print. However, it can be cumbersome and impractical to 
search or manipulate the data without additional tools. Note that simple searches 
are possible using tools like grep or the find feature of a text editor. However, these 
methods do not offer the full search capability of a database. 


Application Dependent Files A more advanced solution is to store the dataset 
in an application dependent format. For example, the Small Groups Library [7] 
available in the GAP computer algebra system [18] uses a specific method to encode 
solvable groups. In many cases, such a choice enables direct manipulation of the 
desired objects. For example, using SageMath pickles makes it easy to load the data 
into an active SageMath session. However, the dataset will not likely be efficiently 
searchable, searching for objects with specific properties will usually require to loop 
over every item in the dataset (a sequential search). Moreover, specialized data 
formats are vulnerable to backward-incompatible changes in the software used to 
load them and are more difficult to maintain in the long run. 


Database Structure Finally, one may store the dataset in a database format, i.e. in 
a format optimized for searching. As above, such a solution will require additional 
software for users to manipulate the data, but the data formats are more stable in 
practice since database software is widely used beyond mathematics. There are 
many types of database available, each optimized with different applications; in this 
article, we will focus on two: 


¢ relational database—a collection of ledger-style tables with a fixed schema, 
where the structure of each row is constant. This kind of database has been 
prevalent for decades; major examples include Oracle [35], MySQL [34], 
PostgreSQL [38], and SQLite [21]. 

¢ document oriented database—a schema free database, where each object is stored 
in a single instance in the database, and every stored object can be different 
from every other. MongoDB [31] is one of the most famous instantiations of 
this paradigm. 


In Sect. 4, we will contrast PostgreSQL and MongoDB. 
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3.2. Front End 


There are many choices for how to enable a user to interact with data. Different 
options have varying degrees of accessibility and utility; we describe three possibil- 
ities below. 


Minimal The author may decide to provide direct access to the database. For 
example, the author may make the database files available through email or a 
webpage; or allow direct read access to a SQL server [8]. 


Computer Algebra System If the database is provided in a computer algebra 
system dependent files, these might evolve into being part of algebra system 
itself, either as a standard or an optional package. For example, the Small Groups 
Library [7] is available in GAP and the K3 Database in Magma [30]. This option is 
extremely convenient for users of those software systems but limits their availability 
to others. 


Static Website The user may also interact with the database through a collection of 
web pages displaying the data. These web pages can range widely in complexity. For 
example, one can manually curate each page individually [23, 3] or automatically 
generate them as static web pages [26, 14, 45, 28, 15]. Such webpages can provide 
a very usable interface, but their main weakness is that the html code for displaying 
the data must be duplicated for each object presented, introducing a large overhead 
that becomes prohibitive as the size of the database grows. 


Dynamic Website Alternatively, one can instead use a web application to generate 
each page when it is accessed. This is essential for large datasets, as generating and 
storing a static web page for each object can be infeasible. The LMFDB follows 
such an approach, where the web page displaying the content is generated on the fly 
through a dedicated web application based on Flask [40] and HTML templates, 
and only a small amount of the content is human-curated. Other examples of 
mathematical databases using such an approach include [6, 10]. The main downside 
of this approach is a higher level of complexity when compared to the static option. 


3.3 Hosting 


Finally, we discuss some of the options for where to physically host the front end 
and the database. We focus on how each possibility varies in terms of cost, ease of 
maintenance and scalability. 


Personal Hard-Drive This is one of the simplest options, as it has no initial set up 
cost. However, it severely limits the database’s accessibility to others due to the lack 
of a front end. 
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Personal Web Page If one already maintains a personal web page, a convenient 
option is to also use it to host the database and the front end. However, the scope 
might be limited, as many universities will only allow their users to host static 
content. 


Web Framework Another convenient option is to use a web framework service 
(e.g. GitHub, CoCalc, Wordpress/Wikidot/blogs). For example, some people might 
dump their database into GitHub, while others might also use it to host the front end. 
These options offer a lower initial set up cost with a minimal long term maintenance 
cost, however, the scope is also limited by the framework, since they control what 
software is used to host content. 


On a University/Personal Server This solution provides complete control. How- 
ever, there is both a high initial cost in buying the servers and setting them up. 
Moreover, one needs to maintain the server for the duration of its life: installing 
security updates, dealing with service interruptions, and managing backups. In 
addition, the database’s scope might be limited by the capabilities of the initial 
hardware choices. The LMFDB uses a couple of servers purchased through grants 
to provide a development platform for our contributors, and host the back end and 
the front end for beta.lmfdb.org 


Using a Hosting Service Servers in the cloud [4, 20] provide complete control 
and have no monetary startup costs, but require ongoing payments proportional 
to the resources used. The main benefit is that one does not need to worry about 
maintaining the servers themselves; cloud servers also have the ability to easily scale 
the resources available if necessary. This option is currently used by the LMFDB to 
host www.|mfdb.org. 


4 MongoDB vs PostgreSQL 


In 2009, when MongoDB was initially released as an open source project, it 
presented itself as an exciting new option to easily store mathematical objects. As 
a document-oriented database, where each document was stored independently in a 
JSON-like format, it did not require a schema, a boon when creating new collections 
of mathematical objects where the data requirements only gradually became clear. 
Moreover, Python bindings were available, a language that many computational 
number theorists were familiar with due to its use in SageMath. This setup worked 
well for many years; however, as the LMFDB grew, various shortcomings of using 
a non-relational database became more evident. In March 2018, we decided to 
transition from MongoDB to PostgreSQL, a popular open source implementation 
of the SQL language that grew out of earlier projects founded in the 1970s and 
1980s. As a relational database, PostgreSQL stores data in a structured way, where 
the data is organized in tables, and each table has a specified schema, i.e., each row 
must have the same layout, with the types of columns constant across all rows. 
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4.1 Comparing Two Database Systems 


MongoDB and PostgreSQL offer two different database paradigms. MongoDB, as a 
document-oriented database, is meant to facilitate the storage of unstructured data, 
where the fields and types present in each document can vary within a collection. 
This flexibility is extremely convenient since the data structure can organically 
evolve as the project develops. However, it can also easily lead to errors and 
unnecessary overhead for a structured database. For example, field names were 
kept short in MongoDB to save space, and multiple typos in the field names were 
found during the transition to PostgreSQL. Furthermore, differing layouts across 
documents required more complicated processing code. In contrast, PostgreSQL is 
a relational database with fixed schemas that enable queries across multiple tables. 
Since the LMFDB was already a uniform set of fields in MongoDB (so much so that 
an inventory application was written to extract a schema from the data), creating a 
formal schema was fairly straightforward. Database-level schema support improves 
robustness and performance, and PostgreSQL’s JSONB type allows for schemaless 
columns when desired. 

One benefit of switching has been that storing the data in PostgreSQL requires 
substantially less space than in MongoDB. For example, when converting our 
biggest MongoDB collection Lfunctions.Lfunctions to the PostgreSQL 
table 1func_1lfunctions, we observed a space savings of about 42%, going 
from 194GiB to 113 GiB. These savings translate both to monetary savings (we 
were spending a total of about $2000 per year paying for storage space using 
MongoDB) and to query speed when a sequential search is performed. 

Two main factors contribute to the smaller storage requirements. First, we no 
longer need to (repeatedly) store the name of each field in every document; this 
was a MongoDB requirement, as fields in a document can vary across a collection. 
For some of the LMFDB’s collections, these names accounted for a substantial 
fraction of the storage requirement. In addition to the space benefits, the switch 
to PostgreSQL also relaxes the pressure to minimize the lengths of these names, 
improving readability. 

The second factor contributing to space savings is PostgreSQL’s support for 
arbitrary precision integers. As a mathematical database, the LMFDB frequently 
uses large integers, which had to be stored as strings in MongoDB. In addition to 
the storage consequences, various workarounds were required to sort and search the 
data correctly. The elimination of these hacks has simplified some of the supporting 
code. 

While smaller tables help with query performance, indexes provide a more 
powerful tool for speeding up searches. Indexes facilitate the location of records 
with constraints on a specified set of columns by storing additional data. Having 
such indexes available is key to providing quick search results for typical queries. 
For example, both MongoDB and PostgreSQL support indexes based on binary 
trees, which work well for totally ordered data such as integers and strings. However, 
there is another query type that is not well supported by binary trees: given a column 
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Table 1 Times for counting the number fields K satisfying the given constraint on set of ramified 
primes 


Query Result/total MongoDB?* PostgreSQL 
HK 35 ¢ Rx} 1% 6h 9min 2.938 

#{K : {2,5} C Rx} 18% 10min 2s 6.9s 

#{K : Rx C {2, 3, 5}} 4% 14min 34s 3.42s 

#{K : Rx C {2, 3, 73}} 0.1% 8min 11s 170 ms 


“We used WiredTiger for comparisons since it outperforms MMAP when using normal amounts 
of RAM. 


Table 2 Times for fetching the objects/rows satisfying the given constraint 


Query ; : MongoDB PostgreSQL 
{K :5 ¢ Rx, | discx | < 2000} 1.95s 0.1s 
{E : #0 = 4,rank E = 1, #£E torsion = 1} Imin 58s 1.95s 


representing a set of integers, search for rows that contain a particular set, or are 
contained in a particular set. For example, we store the set of ramified primes Rx 
for number fields K , and we want to be able to search on number fields by specifying 
such constraints on Rx. Even though we supported such searches through our web 
interface, many such searches took several minutes to perform within MongoDB, 
which caused our web front end to timeout. In PostgreSQL, most of these queries 
finish in a couple of seconds, both when the number of results is not large enough to 
justify using an index (the first three rows of Table 1), and when generalized inverted 
indexes (GIN) [43] apply (the last row of Table 1). 

The significant improvement in indexless queries is another considerable advan- 
tage of the transition to PostgreSQL. Such queries are relevant, since non-typical 
queries can be the most interesting mathematically, and since it is impossible to 
construct all indexes in advance because the LMFDB offers the user a lot of freedom 
to select search parameters. We present two such examples in Table 2 (the first for 
number fields and the second for elliptic curves), where the number of results is 
about 1000 in each case. We also note that successful queries are instantaneous on 
a second run, as both databases cache the results. 

While search performance is vital for website usability, the process of uploading 
data and creating indexes is also essential to make the development sustainable 
and to aid in the implementation of new features. PostgreSQL’s COPY FROM 
command allows for the rapid importing of data from plain-text files without a 
Python intermediary, which is useful when the data is generated by some other 
system such as Magma [9]. For example, we imported a 14,398,359 row table from 
a 680 GB text file in about 8 h. With MongoDB we estimate that would have taken 
us several days. Creating indexes in PostgreSQL is also dramatically faster than 
in MongoDB, approximately by an order of magnitude. Additionally, PostgreSQL 
offers a more diverse and mature set of open source front ends and administration 
tools, for example [36, 47], that lower the learning curve for the SQL language. The 
transition has substantially improved database management. 
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In addition to performance improvements, PostgreSQL makes new kinds of 
queries possible. Relational databases allow joining tables, returning values from 
both tables when they share some specified relationship. The existing schemas, 
having been designed for MongoDB, do not take advantage of this feature, but new 
tables on finite groups and on p-adic tori will. 


4.2 Transition 


Once the decision had been made to switch from MongoDB to PostgreSQL, we 
faced two main tasks: to port the data and to update the LMFDB codebase to use 
PostgreSQL. The majority of the work was done by the second author between 
March 2018 and August 2018, with assistance from the first author starting in July. 

Since PostgreSQL requires the specification of schemas, the initial step of the 
data migration involved defining the schema for each table. In practice, most of 
the tables had a schema already, which were made visible by a custom inventory 
module within the LMFDB.” Unfortunately, the accuracy of these schema varied: 
some collections had multiple independent kinds of records within the same table, 
while others had typos in some of the field names. Moreover, the inventory system 
itself proved to reflect the types of some columns inaccurately, since it inferred the 
type from finding a single record. For example, it concluded that a specific field was 
areal number, while some documents had complex values. Eventually, we overcame 
these issues and created a schema for each PostgreSQL table. 

In some cases, there were multiple possible types to choose from for a given 
column. PostgreSQL 9.4 added support for a binary JSON type, which allows for 
unstructured data to be stored in a column. It provided an alternative to arrays 
that could also store an analog of Python’s dictionaries, with strings as keys. For 
simplicity, we initially chose to use this JSONB type instead of arrays, since it 
simplified some of the supporting code. However, after much of the rest of the 
transition was complete, we realized that when arrays are possible (namely, when 
the type is uniform across all entries), they provide some significant advantages. 
Namely, arrays are more space efficient than the JSONB type, and indexes on 
arrays support more operations than indexes on a JSONB column. We are gradually 
transitioning JSONB to array columns where useful. 

We also took advantage of the transition to change some data types that had been 
chosen due to the deficiencies in MongoDB. For example, many large integers had 
been stored as strings since MongoDB lacked an arbitrary precision data type. With 
the switch to PostgreSQL, we were able to use the numeric type instead, which 
simplified code used to sort and compute with the results. 


?The inventory displayed the kinds of documents within each collection, the fields present on each 
type of document, the indexes present in each collection, and both machine and human generated 
information on the types of each field. It was a very useful resource for constructing schema. 
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Since one of the goals of the switch was to improve performance, we made some 
changes to the schemas to help speed up queries. In particular, we split some of the 
largest tables in half, putting columns that were used for searching into a search table 
and columns that were only used for display into an extras table. This significantly 
reduced the search table’s size, improved index efficiency, and sped up indexless 
queries that had to traverse the whole table. Query performance is also affected by 
sort order, and the default sorting on many tables uses four or five columns. We 
added a primary key column to each table, and for large tables we ordered this 
primary key by the table’s standard order. This feature decreased the size of indexes 
and improved their performance since search results could be ordered by a single 
numeric column rather than a combination of many columns. This optimization was 
feasible because the data in the LMFDB changes infrequently, and because each 
search page comes with a default order. 

With the schemas created and export scripts written, the actual changeover of the 
data went reasonably smoothly. PostgreSQL provides a COPY FROM command 
to load data from a text file, which runs very quickly. After several false starts, 
exporting 400 GB of data from MongoDB to text files took three or 4 days; copying 
it between servers on opposite sides of the Atlantic took less than a day, and loading 
it into PostgreSQL took less than a day. 

Adapting the LMFDB codebase to use PostgreSQL took longer. The backend 
code for the LMFDB is written in Python, and the Python bindings for MongoDB 
and PostgreSQL take a very different approach. PyMongo [32] offers a high 
level interface to the database that integrates python data structures. Queries are 
performed by constructing dictionaries that specify the values of certain columns 
and have special keys for adding more complicated constraints such as inequalities 
or boolean combinations of conditions. Psycopg [46], on the other hand, is a thin 
interface that offers the user the ability to execute SQL statements. Constructing 
queries using dictionaries worked well as a model for the LMFDB, since the search 
pages provided many inputs that could be processed independently into a dictionary. 
Moreover, we did not want to require LMFDB developers to learn SQL. Therefore, 
we implemented a high-level interface to PostgreSQL that translated a dictionary- 
based query system similar to PyMongo’s into SQL SELECT statements while 
avoiding SQL injection attacks. This query interface has already proved useful in 
rapidly prototyping another project, researchseminars.org, and we plan to make it 
available as a stand-alone tool. 


5 Benefits 


5.1 Abstraction 


As the LMFDB has grown, the most common paradigm for adding new features has 
been to take a functional section of the website, copy it to a new folder, and modify 
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the templates and backend code. This approach has the benefit of easily producing 
valid code since it is iteratively making small changes to an already functional 
webpage. However, it leads to large quantities of repeated code, which makes fixing 
bugs and adding features much harder, since the same change must be applied in 
dozens of locations. The standard solution is to encapsulate tasks into functions that 
can then be used repeatedly and changed as needed. Of course, the LMFDB has 
used functions like this since the beginning, but often these functions are not written 
in a way that allows them to be used in multiple sections of the website. 

The task of switching from MongoDB to PostgreSQL was made more difficult by 
this low level of abstraction, but it also presented an opportunity. Since the database 
change necessitated changing a large number of files throughout the project, it 
offered a perfect time to make additional changes. We highlight three in particular 
to illustrate the kinds of encapsulation that accompanied the database change: a new 
data-management interface, more robust and uniform handling of search inputs, and 
common tools for creating statistics pages. 

We centralized many of the scripts used to upload data into a suite of data 
management tools. There is now a single object available from the Sage command 
line that has methods for performing a variety of tasks: creating new tables and 
modifying existing schemas, copying data to and from text files, moving data 
between the beta and production versions of the database, accessing PostgreSQL’s 
tools for analyzing slow queries, adding indexes and constraints, and tracking 
statistics as data is changed. The centralization of these tasks allows us to reduce 
code duplication and use best practices to handle errors. 

Each section of the LMFDB has a search page, where users can input constraints 
they would like their curve, field or group to satisfy. Some input boxes require 
an integer or list of integers; others need a label string in a particular format; 
others a rational number. The processing code must process the user provided text, 
raising appropriate errors on invalid input, and transform the set of inputs into a 
database query. Before the database change, we had already begun the process of 
encapsulating this task. Each input box corresponds to a specific field or column 
in the database, and there is a processing function for each type of input (list 
of integers, floating point, etc.). As we switched each part of the LMFDB to 
PostgreSQL, we ensured that it was using these functions. 

Once these queries have been constructed, we need to execute the search and 
pass the results to the Jinja [39] template that actually creates the webpage seen by 
the user. Various parts of this task are shared among all sections of the LMFDB. 
For example, we have to be able to handle errors in user input, errors or timeouts 
in the execution of the query, and jump boxes that allow a user to go straight to an 
object with a particular label. As part of the switch to PostgreSQL, we created a 
search wrapper that performs all of these jobs, and adopted it across the LMFDB. 
In addition to simplifying the code overall, this change made the user experience 
more uniform and allowed for the creation of new features, which will be described 
in the next section. 

Many parts of the LMFDB had a statistics page prior to the database change. 
These pages described the number of objects with certain attributes; for example, 
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a table of how many number fields were available by signature, or a table giving 
the fraction of elliptic curves with a specified torsion structure. Along with the 
database change, we created a class to support the database queries required for 
these statistics, and a common template to display them nicely. This change has 
the benefit that it automates the process for updating these counts when developers 
add data. It also makes it easier to create these statistics pages for more sections of 
the LMFDB, and makes it easier to implement new features such as user-requested 
statistics. 


5.2 New Features 


In addition to the intrinsic benefits of using PostgreSQL, the transition has eased the 
implementation of a number of new features. While many of these would have been 
possible using MongoDB, the improved abstraction made implementing them much 
more manageable. And some, such as dynamic statistics and verifications, rely on 
PostgreSQL’s improved performance and the relational model. 

PostgreSQL supports transactions, allowing one to roll back to a functional 
earlier state if some later action fails. We have built these transactions into our 
interface layer, making it possible to safely modify rows in place, rather than 
creating a copy of the whole table when changes are made. 

The fact that all interactions with the database pass through our interface layer 
allows us to add custom logging. For example, all queries that take longer than a 
threshold are recorded, allowing us to focus on creating the right indexes. We also 
log high-level actions, such as adding new data to a table or changing a schema, 
along with a username to provide a record of what changes have been made to the 
data. These logs help us ensure the quality of our data. 

The statistics infrastructure has enabled a new user interface which we refer to as 
dynamic statistics. Rather than presenting only a pre-selected set of views, the user 
can specify which variables they are interested in and the objects’ constraints. The 
system then generates a display describing the possible values of those variables and 
the number of objects possessing each possible set of values. For example, a user 
can create a table to view how the weight and level of modular forms vary among 
forms with complex multiplication. As of June 2020 we are still in the process of 
enabling this feature throughout the LMFDB. We hope that this feature will provide 
researchers with a new method of interacting with the LMFDB, allowing them to 
look for large scale patterns in these objects in addition to searching for objects with 
particular properties. 

Many browse pages in the LMFDB already had a link to view a random object. 
Because of the new search wrapper abstraction, it was easy to add the ability to 
return a random object satisfying a search query. This feature saves time when 
browsing since it allows you to more quickly reach the homepage of an object 
satisfying a particular constraint, such as CM elliptic curve or a weight one modular 
form. 
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During the transition to PostgreSQL, we modified the search results pages so 
that they do not always provide the total count. The count is provided only if it is 
sufficiently small or has already been cached (though it is available in every case if 
requested). This change dramatically improved response times in some cases, where 
a large number of results makes finding the first few much easier than counting them 
all. 

Mathematical data differ from data in most other applications because it has a 
notion of correctness that can be checked, rather than just providing a record of a 
real-world measurement. Some of these verifications are internal, such as checking 
that the defining polynomial of a number field is irreducible or that the sum of the 
complex embeddings of classical modular matches the trace of the exact newform. 
Others rely on connections with other tables in the database, such as checking 
that invariants match up between an elliptic curve and its corresponding modular 
form. These kinds of verifications across different tables are much faster using 
PostgreSQL, since the relational database model is explicitly designed to search 
across different tables. The switch to PostgreSQL made it possible to create a 
framework to write such consistency checks and run them whenever new data is 
added. 

We are excited about the new features that we added to the LMFDB and we 
believe that the transition to PostgreSQL will aid with the long term maintenance of 
the project. We look forward to how the abstraction layer we have created around 
database interactions will help standardize the user experience across LMFDB. We 
hope others will learn from our experience in their efforts to leverage computation 
and data towards the study of mathematics. 
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Effective Obstruction to Lifting Tate M®) 
Classes from Positive Characteristic hook for 


Edgar Costa and Emre Can Sertéz 


Abstract We give an algorithm that takes a smooth hypersurface over a number 
field and computes a p-adic approximation of the obstruction map on the Tate 
classes of a finite reduction. This gives an upper bound on the “middle Picard 
number” of the hypersurface. The improvement over existing methods is that our 
method relies only on a single prime reduction and gives the possibility of cutting 
down on the dimension of Tate classes by two or more. The obstruction map comes 
from p-adic variational Hodge conjecture and we rely on the recent advancement 
by Bloch—Esnault—Kerz to interpret our bounds. 


1 Introduction 


Let Xo := Z(f) Cc Pe be a smooth complex hypersurface of even dimension 
2r with the polynomial f having algebraic coefficients. The dimension p’ (XC) 
of the space of codimension r algebraic cohomology classes A’(Xc) ®z Q in 
Hn (XC; Q) is called the middle Picard number of Xc. Due to the Lefschetz 
hyperplane theorem, there are no interesting algebraic cycles in codimension 
different from r. The middle Picard number is a difficult invariant to compute, but 
it strongly constrains the geometry, arithmetic, and dynamics of the hypersurface. 
Recently, there have been numerous developments on computing the middle Picard 
number and we review them in Sect. 1.5. 

The main contributions of this paper are Algorithm 4.1 and Proposition 2.22. 
They give practical means for computing upper bounds for p’(Xq) starting 
with the defining equation f. The key idea is to take a prime reduction of the 
hypersurface and compute the obstruction to lifting algebraic (or Tate) classes of 
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the reduction to characteristic zero. We give an implementation! of this algorithm 
in SageMath [Sag20]. In Sect.5, this implementation is used to illustrate the 
strengths and weaknesses of the method. To the best of our knowledge, our 
method gives a superior performance over any other method that has a computer 
implementation. 

There is a practical method, due to van Luijk [vLui07], for bounding the Picard 
number of K3 surfaces, which has now become the standard. When the K3 surface 
X is defined over a number field, the Picard numbers of its prime reductions give 
upper bounds on the Picard number p!(Xq). The method of van Luijk can improve 
these bounds by at most one, via a comparison of two different reductions. However, 
some K3 surfaces with real multiplication do not admit a prime reduction where the 
Picard number jump is less than two [Cha14]. 

Our approach is different, and we do not suffer from these limitations. For 
instance, we do not need two distinct prime reductions to improve the bound. We 
also see in Example 5.9 that our method does not necessarily break down in the 
presence of real multiplication. Additionally, in Example 5.7, we give an example 
of a K3 surface, whose Picard number jumps by six at a prime reduction, and show 
that our method detects this jump. 

Let us recall that for simply connected surfaces such as K3s, the Picard group 
and its image in integral cohomology, namely the Néron—Severi group, coincide. 
These are free abelian groups whose rank equals the Picard number. 

In Example 5.5, we go through the 184,725 smooth quartic K3 surfaces in the 
database of [LS19] to verify computations and benchmark performance. In Sect. 5, 
we give many more examples, involving Picard numbers of quintic surfaces and 
endomorphisms of Jacobian threefolds. 

For the rest of the paper we will work with number fields as opposed to the field 
of complex numbers. In the end we do not lose generality since p’ (Xc) = p” (Xq): 
see for instance Lemma 2.5. 


1.1 Lifting Algebraic Cycles from Finite Characteristic 


We will now sketch the core concept behind our algorithm. To simplify the 
exposition, let us assume f has rational coefficients so that X = Z(f) C peer 


For most prime numbers p € Z the reduction X, := Z(f mod p) C Po will 
Pp 

be well defined and smooth. Fix one such prime p. The choice of such a prime 

introduces an infinite sequence of hypersurfaces 


Z(f mod p") c Pain C115 


that can be said to approximate X. 


'The code is available at https://github.com/edgarcosta/crystalline_obstruction. 
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As we shall soon see, one of the strong points of working with a finite field 
[', is that the determination of algebraic cycles on the variety Xp/I'p is greatly 
simplified in comparison to X/Q. After determining algebraic cycles on Xp, one 
must determine which cycles can be lifted to Z( mod p”) for all n, and eventually 
to X. 

Hodge theory gives an obstruction to lifting cycles from characteristic p, 
namely the map (1.1.3) below. The p-adic variational Hodge conjecture states 
that any unobstructed cycle lifts to X. See [Eme97] and [AMMN20] for more 
details on this conjecture. Bloch-Esnault-Kerz [BEK14] prove that an intermediate 
statement is true: an unobstructed cycle can lift compatibly to all the hypersurfaces 
Z(f mod p”) approximating X. Except when r = 1, it is unknown whether this is 
sufficient for lifting to X. 

As an example, suppose X is a K3 surface. Then the obstruction to lifting a curve 
C Cc X, can be represented by a p-adic integer aj p+ anp*+-..€ pZy. The curve 
C is the restriction of a curve in Z(f mod p”) if and only if a4) = az = --- = 
an—1 = 0, see also [EJ1 1b]. 

The crystalline cohomology is the cohomology theory we need in order to com- 
pute obstructions of cycles. This theory associates a well behaved cohomology to 
smooth and proper varieties over finite fields [BO83, Ber74, Cha98]. A remarkable 
feature of the theory is that the crystalline cohomology of X , can be identified with 
the algebraic de Rham cohomology of X with p-adic coefficients (Sect. 3.1), 


Hetys(Xp/Zp) @ Qy ~ Hig(X/Q) BQ Qp. (1.1.2) 


The algebraic de Rham cohomology is defined as the hypercohomology of 
the usual differential complex (2% /Qr See [Gro66] for the conception of this 
idea and [HM17, §3] for more details. The resulting cohomology groups have 
coefficients in the base field and when tensored with real or complex numbers, one 
recovers the classical de Rham cohomology groups. 

The image of the crystalline cycle class map is a group A’ (Xp) C Hee p/Zp) 
representing algebraic cohomology classes of X ». On the other hand, the cycle class 
map for de Rham cohomology also gives us the group A’(X) C Hin (X /Q) of 
algebraic cycles on X that are defined over Q. Let us denote the Q-spans of A’ (X) 
and A"(X») with A’(X)q and A’(X p)q respectively. 

There is a containment 1: A"(X)Q <> A’(Xp)qQ obtained by reducing subvari- 
eties of X modulo p, see Sect. 2.1.4. Moreover, comparing with complex Hodge 


theory one sees that A’(X) must be contained in the r-th piece of the Hodge 


filtration F” He p/Zp) on Herys(X p/Zp). Consider the quotient map 
Tv. Hetys(X p/Zp) a Heys(X p/Zp)/F Herys(X p/Zp) (1.1.3) 


and note that 7(A’(X)) = 0. The theorem of Bloch—Esnault-Kerz (Theorem 2.7) 
states that any element in the kernel of 
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TA c= | Ar (X p) (1.1.4) 


lifts to a compatible sequence of algebraic cycles on Z(f mod p”) for eachn > 1. 
It is unknown whether every element in ker74 comes from an element in A’ (X), 
see loc. cit. However, when r = 1 we do have A!(X) = kerz, due to [Gro63, 
Theorem 5.1.4]. 

See Sect. 2.1.6 for more details on the obstruction map 14. We also recommend 
the introduction of the paper [BEK14] for the intuition it provides in interpreting 
their main result. 


1.2. Tate Classes As a Substitute for Algebraic Classes 


Tate’s conjecture gives a hold on the space of algebraic cohomology classes A’ (X p). 
The Frobenius action on X, induces an action on crystalline cohomology by 
functoriality and, therefore, a map Frob: Hee p/Zp) > He gx p/Zp). Tate’s 
conjecture states that the (,-span of A” (Xp) coincides with p’ -eigenspace of Frob. 

Write T’(Xp) := ker(Frob—p’). The elements of T’(X,)) are called Tate 
classes. The inclusion A’(Xp») C T’(Xp) is always true and A’(X))Q @Q 
Q, = T'(Xp) is the Tate conjecture. This conjecture is known to hold for K3 
surfaces [Chal3, Mad15, KM16]. 


Consider the restriction of the quotient map (1.1.3) to the space of Tate classes, 
UT — H|Tr (Xp): (1.2.1) 


By the discussion above, A’(X) is contained in kerzy and one can deduce the 
inequality 


dimg A’(X)q < dima, kerzr. (1.2.2) 


Our Algorithm 4.1 describes a practical method to bound dima, kerry. We 
should point out that the codomain of zr is typically small when viewed as a Q),,- 
vector space but infinite dimensional as a (Q-vector space. If X is a K3 surface then 
the codomain of zr is | dimensional. Therefore, as it stands, the bound (1.2.2) can 
be underwhelming. Ideally, one should seek to recover a €)-structure on T’(X p) 
containing A’(X,,) to leverage the fact that dimg Q, = ov. In this way, the 
codomain of 27 would have the capacity to obstruct arbitrarily many classes. We do 
not take this approach in this paper, but we say more about it in Sect. 1.4. 

Instead, we make use of the Galois module structure on Tate classes to 
improve (1.2.2). A slight modification of the discussion above allows one to pass 
to the algebraic closures of the base fields. This allows us to compute bounds on 
dime A’ (XM) = dimg A’(Xc). Furthermore, these bounds on dime A’(Xc) 


can be computed without leaving the comfort of Helys(X p/Zp). We explain how 
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to enlarge the base field in Sect. 2.3. In the process, we show that algebraic cycles 
appearing over different base fields can be obstructed independently. This is the 
content of Proposition 2.22, which improves over (1.2.2). 


1.3. A Note on Using Finite Approximations 


A p-adic approximation of the Frobenius action on crystalline cohomology 
Frob: Herys(X p/Zp) > Hee /Zp) can be obtained as in [AKR10, Cos15, 
CHK19]. Using this approximation, we find a vector space approximating the space 
of Tate cycles T”(X). We work in a coordinate system that respects the Hodge 
structure, see Sect.3.4, and allows us to approximate the obstruction map 77 on 
T’ (Xp). With careful management of precision, we can compute a rigorous upper 


bound on the dimension of the kernel of 7, see Sect. 4.1.10. 


1.4 A Limitation and the Need for Integral Structure 


There is a shortcoming of our approach. Even when the Tate conjecture holds, and 
even when A‘(X) is characterized in A’ (X) by the kernel of m4, the inequality 
dimg A’(X)Q < dima, ker wry can be strict. This is because A’ (X)q is a rational 
vector space, while the obstruction map can be irrational, see Example 5.8 where 
Proposition 2.22 cannot help. 

This issue can be circumvented to a large extent if one can identify the image 
of A’(X,) inside T’(X,). It is a well-known shortcoming of the Tate conjecture 
that, unlike the Hodge conjecture, there is no description of this image, even 
conjecturally. 

Suppose we can compute (or approximate) any Z-lattice A inside T’(X) that 
would contain A’(X,). Using pLLL [IN17], the p-adic version of LLL [LLL82], 
we can get a good guess on the “integral” kernel of zr, restricted to A. This would 
sacrifice rigor in return for what is most probably the correct value of p’ (Xc). 

This idea can be compared to the one used in [LS19] in the setting of complex 
periods. There, integral Betti cohomology serves as the Z-lattice A inside the 
complex Betti cohomology. 

In the present paper, we do not consider the problem of identifying a lattice A C 
T’(X,). Nevertheless, we tried to set-up the theory in a way that anticipates this 
development. We addressed issues of torsion in integral crystalline cohomology and 
the state of knowledge regarding the properties of the obstruction map with integral 
coefficients, see Sects. 3.3 and 3.4. 
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1.5 Previous Approaches 


Given the importance of A’(X), several techniques exist in the literature for 
obtaining information on A’ (X) or p’ (X) for a given X. 

For example, the authors of [PTvL15] provide an algorithm for surfaces con- 
ditional on the computability of the étale cohomology with finite coefficients. 
In [HKT13], the authors provide an algorithm for K3 surfaces of degree 2 condi- 
tional on an effective version of the Kuga—Satake construction. Another algorithm 
to compute the geometric Picard number of a K3 surface, conditional on the Hodge 
conjecture for X x X, is presented in [Chal 4]. 

Often, as in the algorithms mentioned above, one obtains a lower bound for p(X) 
by exhibiting divisors explicitly. However, there is no known practical algorithm to 
do this in general. Nonetheless, if X has some additional structure, practical methods 
may arise. For example, when X is a product of curves [CMSV 19], is a quotient of 
another variety by finite group [Shi86], or is an elliptic surface [Shi72, Shi90]. 

The specialization homomorphism of algebraic cycles 1: A!(X) <> A!(X p) 
is used frequently to compute Picard numbers of surfaces. For instance, one may 
compare the lattice structure on A!(X p) for two different primes to limit the image 
of ¢ [vLui07], or use the Artin—Tate conjecture for surfaces [K1007]. One can also 
view t as a morphism of two Galois modules, as was done in [EJ] 1a], see also 
Proposition 2.22. Alternatively, when explicit elements of A! (X p) are known, one 
can rely on their geometry to show that some of them cannot be lifted. This becomes 
a powerful tool in the absence of torsion in the cokernel of 1, see [EJ1 1b]. 

The methods outlined in the previous paragraph are strongest when the rank jump 
between A!(X) and A!(X p) is at most one. However, Charles [Chal4] proved 
that some K3 surfaces may never admit a prime reduction where dim A!(X p) 
dim A!(X) < 1, see Example 5.9. 

We should highlight the difficulty in computing Picard numbers of surfaces. 
For instance, only recently did Schiitt [Sch15] obtain the set of values that can be 
attained as the Picard number of a quintic surface. We still do not know this set 
for sextic surfaces. See the introduction and Sect. 2 of loc. cit. for a comprehensive 
overview. 

The papers [Ser19, LS19] tackle the computation of the middle Picard number 
of a hypersurface using complex transcendental methods. That method seems very 
reliable (Example 5.5), but proving the result of its computation can be challenging, 
see for instance [MS]. 


1.6 Overview 


In Sect. 2, we present the theoretical framework of our method. This includes an 
overview of the problem of lifting algebraic cycles from positive characteristic. 
We state the Bloch—Esnault—Kerz theorem (Theorem 2.7) and show how to use 
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it in computations, to constrain the liftability of certain Tate classes. In Sect. 3, 
we recall how to effectively compute in the crystalline cohomology of a smooth 
hypersurface. Here, we slightly strengthen the known results to better handle 
torsion. In Sect. 4, we present the main algorithm of this paper (Algorithm 4.1) and 
clarify each computational step. In Sect. 5, we explore examples that illustrate the 
strengths and weaknesses of the method. We also demonstrate the usage of the code 
crystalline obstruction. 


2 Lifting Algebraic Cycles from Positive Characteristic 


In this section, we will be working with a smooth projective variety X defined over 
a number field K. The goal is to constrain the dimension of the span of algebraic 
cohomology classes on Xx% := X XK K, for an algebraic closure K of K. 

We begin with a review of the core concepts we will use. In Sect. 2.1 we setup 
notation for the passage to a finite field from K. We then recall the statement of 
Tate’s conjecture for finite fields in Sect. 2.2. 

The later subsections are intended to reposition these concepts to simplify 
computations. In particular, we do not want to extend the base for crystalline 
cohomology when computing. Section 2.3 makes the first simplification that allows 
us to find “eventual Tate classes”, see (2.3.2). In Sect.2.4, we show that the 
obstruction map can be studied at the level of eventual Tate classes and the 
obstruction map can be applied seperately on elements that appear at different levels 
of field extensions. The conclusion, Proposition 2.22, multiplies the extent to which 
we can obstruct classes. 


2.1 Passage from Number Fields to Finite Fields 


Let K be a number field with ring of integers Ox. Fix an unramified prime ideal 
p C Ox. Localizing and completing Ox at p we get a local field Q, and its ring of 
integers Zp. We will continue to write p for the maximal ideal of Zp. Let k be the 
residue field at p and p the characteristic of k. 

The ring of Witt vectors W(k) of k and Zp are canonically isomorphic. In fact, 
for some n there is an isomorphism k ~ Fp» and W(k) is determined uniquely 
up to isomorphism as the unramified extension of degree n of the ring of p-adic 
integers Zp. 


2.1.1 Good Reduction 


With X — Spec K a smooth projective variety, assume that p C Ox is chosen 
so that that X has good reduction at p. That is, we assume that there is a regular 
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scheme X — Spec Zp, smooth over the base, such that XQ, = X xxK Q, is 
identified with the generic fiber X xz, Q,. We will write Xp for the special fiber of 
X, and Xp = Xp Xx k for its base change to an algebraic closure k of k. 

We will also use “thickenings” of Xp, namely X, := X @z, (Zp/p") for each 
n > 1. Let X be the formal scheme obtained by completing X along the special 
fiber Xp. 


2.1.2 The Specialization Map on Subvarieties 


For any r > 0 and a reduced scheme Y, the algebraic cycle group 2’ (Y) of Y is 
the free abelian group generated by codimension r subvarieties of Y. When Y is a 
formal scheme, we use Z’ (Y) to denote formal subschemes of codimension r. Let 
us write Z"(X)’ C Z"(X) to be the free abelian group generated by subvarieties in 
X that are flat over the base Zy. 

There is an isomorphism 2” (XQ,) — 2'(X)' which maps each subvariety V C 
X to its closure in X. The inverse of this map is given by intersecting a flat variety 
in X with the generic fiber X Q,- There is also a restriction map 2" (X)’ > 2’ (Xp) 
obtained by intersecting a subvariety of X by the special fiber X,. Composing the 
isomorphism above with the restriction map gives the specialization map [Ful98, 
§20.3] 


sp: Z"(XQ,) > 2 (Xp). (2.1.1) 
Let us point out that the specialization map factors through 2” (0), 
2"(Xa,) > 2 Oy 7 a (2:43) 


though the first map does not need to be surjective. 


2.1.3 A Hodge Filtration on Crystalline Cohomology 


For each r > 0, the de Rham cohomology Hip (X/Zp) of X comes with the Hodge 
filtration F°H),p(%/Zy). Using the Berthelot comparison isomorphism [Ber97, 
Shi02] (see also Sect. 3.1), 


®: Hip (X/Zyp) aes Heys(Xp/Zp), (2.1.3) 
we can carry the Hodge filtration over to the crystalline cohomology of Xy. Let us 
point out that the resulting filtration is not intrinsic to X, but depends on the model 
X/Zy. The filtration modulo p is, however, intrinsic to Xp. 
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Definition 2.1 Let FY Horys(Xp /Z») denote the filtration induced by the 
Hodge filtration on de Rham cohomology carried over by the comparison 


isomorphism (2.1.3). 


2.1.4 Cycle Class Maps 


There are cycle class maps cag [Har75] and Cerys [GM87] (see also [BEK14, §8] for 
a review of the crystalline cycle map with integral coefficients) making the following 
diagram commutative: 


Z"(X) ————+ 2"(X)!’ ———_+ 2"(Xy) 


[ear | [eos 


We define the middle vertical arrow so as to make the square on the right commute. 
The map © on the bottom right is the Berthelot comparison map (2.1.3). The map 
on the bottom left is the composition: 


Hap (X/Zp) > Hea (X/Zp) @Z, Qy > HaR(X/Qyp)- (2.1.5) 


Definition 2.2. The image of the cycle class maps, in the appropriate cohomology, 
will be denoted with A’(X) and A’(Xy). Their tensor with Q are denoted by 
A’ (X)q and A"(Xp)qQ, respectively. The elements in the image of the cycle class 
maps, or elements in their Q-span, will be called algebraic cohomology classes. 


Lemma 2.3 The maps above give an injective homomorphism t: A'(X)Q @ 
A’ (Xp)Q. 


Proof The cycle class maps are compatible in the sense that the diagram (2.1.4) 
commutes after tensoring with Q. Furthermore, the horizontal bottom arrows are all 
isomorphisms once tensored with Q, and thus we have injectivity. oO 


Remark 2.4 In fact, even without tensoring with Q, we can obtain an injection. 
The specialisation map A’(Xx) — A’ (Xp) preserves the intersection pairing, 
see [Ful98, Corollary 20.3]. Using that the polarization maps to the polarization, 
and using the Hodge index theorem on A’ (Xz), we conclude that no element can 
map to zero. 
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2.1.5 Dimensions of the Space of Algebraic Cycles Over Different Fields 


Let Pperj(XC) denote the dimension of the space of codimension r algebraic 
cycles A’(Xq) in the Betti cohomology Hi (X c, C) of the associated complex 
manifold. 

Let L/K be a field extension of K. Define pjp(X_,) as the dimension of the 
algebraic cycles in the de Rham cohomology of Xy := X xx L. 

Let K C C be the algebraic closure of K <> ©. Let K > Q, be a localization 
as in Sect. 2.1 and let Q, be an algebraic closure of Q,,. Then we have the following 
series of equalities: 


PReti(XC) = Par (XC) = Par(XR) = Par(XQ,)- (2.1.6) 


The first equality follows from the standard comparison isomorphisms. The other 
equalities result from the following well-known fact. 


Lemma 2.5 Suppose K is algebraically closed and characteristic zero. Let L/K 
be a field extension with L algebraically closed. Let Xx /K be a smooth projective 
variety and X 1 its pullback to L. Then for every r = 0, Pip(XK) = Pjp(Xz)- 


Sketch of Proof In characteristic zero, the cycle class map cgr is well defined, and 
it factors through the Chow groups CH” (the algebraic cycle group 2’ modulo 
algebraic equivalence). But the Chow groups of Xx and X, are canonically 
isomorphic. 

The inclusion CH’ (Xx) <> CH"(X_) is induced by pulling back varieties in 
Xx to X_. In the reverse direction, take a subvariety V C X ,. Without loss of 
generality, we may assume L is the field of definition of V over K. In particular, L 
now has finite transcendence degree over K. We can find an affine K-variety Y with 
function field L over which V extends to a flat family of varieties V > Y so that 
V<— Xx Xx Y isaclosed immersion. Any fiber of V > Y over a K-valued point 
defines an element in CH’ (Xx). oO 


2.1.6 The Obstruction Map 


The goal here is to give a partial description of the image of the inclusion 
u: A"(X)Q — A’(Xp)q. That is the image of the composed map 2’(X) > 
Zs (Xp) > Het p/Zp). The following theorem instead allows us to describe the 


image of the composed map 2” (X) > 2 (Xp) > Helys(Xp /Zy) (see also (2.1.2)). 
Definition 2.6 An algebraic cycle € € A’(Xy) is unobstructed (with respect to X) 
ifé € Fee (Xp/Zp), with the filtration defined in Definition 2.1. A cycle is 


crys 
obstructed otherwise. 
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Note that unobstructed cycles form a subgroup. The terminology is justified by 
the following theorem of Bloch, Esnault, and Kerz, which states that modulo torsion, 
the group of unobstructed cycles, can be lifted to ca 


Theorem 2.7 ([BEK14, Theorem 1.3]) Suppose X/Zy is smooth and projective. 
For each r > O and p = char k > dimXy + 6 the image of the composition 
Zr (X) @Q > 2" (Xp) @ Q — A" (Xp)Q coincides with the group of unobstructed 
cycles. 


Definition 2.8 The obstruction map of Xy with respect to X is the quotient map 


(Xp/Zy) > H2” (Xp/Zp)/FoHe,,(Xp/Zp)- 


crys crys 


aX: aa 
If it is clear from context, we may tensor zx with Q, without changing notation. 
Restrictions of zr, to a subspace V of cohomology will be abbriviated to zy. 


Remark 2.9 If we are given Xp but not X the model X/Zy is not unique. The 
choice of a model X/Zy impacts which cycles are obstructed. 


Remark 2.10 A theorem of this nature—in particular, the ability to define the 
Chern class map for line bundles—was the original motivation for the definition 
of crystalline cohomology [Gro68, $7.4]. For line bundles, this goal was realized by 
Berthelot and Ogus in [BO83]. 


2.2 Finding the Image of the Chern Class Map via Tate’s 
Conjecture 


Due to its computational complexity, we would like to avoid computing and 
representing actual subvarieties in X or even Xp as much as possible. On the other 
hand, the @,,-span of algebraic cycles in cohomology has, at least conjecturally, a 
computationally tractable description as Tate classes [Tat66b, Mil07]. 

Recall that there is the arithmetic Frobenius map Xp — Xy over k. As 
crystalline cohomology is functorial, the relative Frobenius map induces a map on 
cohomology: 


Froby: H2" (Xp/Zp) > H2” .(Xp/Zp). (2.2.1) 


an crys 
Definition 2.11 The integral Tate classes are the q~” Frob,-fixed elements of 
crystalline cohomology, they form T’ (Xp)int C Hy lt p/Zp). The fixed elements 
of q7" Frob, in Hoe Xp /Zy) ® Q, are Tate classes and they form the space 
T’ (Xp). 


Conjecture 2.12 (Tate Conjecture) For each r, algebraic cycles span T' (Xp); 
that is; A’ (Xp) ®z Q, ~ T' (Xp). 
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Remark 2.13 The stronger statement A"(Xp) @z Zp = T"(Xp)int is called the 
integral Tate conjecture. This version is often false due to torsion, though it may be 
true modulo torsion. In any case, when r = 1, then the usual Tate conjecture implies 
the integral Tate conjecture [Mil07]. 


Remark 2.14 Tate himself proved the Tate conjecture for abelian varieties over 
finite fields [Tat66a]. More recently, the Tate conjecture was proven for K3 
surfaces [Chal4, Mad15, KM16]. For an overview of the Tate conjecture over finite 
fields, as well as recent progress on it, we recommend [Mil07, Tot17]. 


2.3 Enlarging the Base Field 


Given the smooth projective variety X/K we want to compute the dimension of 
A’ (Xz), with K an algebraic closure of K. If Xp is our choice of finite reduction, it 
is likely that A’ (Xz) will not map into T’ (Xp) and we need to enlarge the residue 
field k. The dimension of the space of Tate classes will increase as we enlarge k 
but will eventually stop growing. This terminal dimension, as well as the terminal 
dimension of the unobstructed Tate cycles, can be computed without ever enlarging 
the base field. We will explain how to do this here, paying attention to computational 
limitations. 

Let x2, € Q[f] be the characteristic polynomial of g~" Frob,, a scaling of the 
Frobenius map (2.2.1). Factorize this characteristic polynomial over Q[t] as follows 


xor(t) =h TL ei”, (2.3.1) 


where, for each i, the polynomial ®;(t) € Z[t] is the i-th cyclotomic polynomial, 
i.e., the minimal polynomial of any primitive i-th root of unity, the exponents y; > 0, 
and no root of A(t) is a root of unity. Set u := Icm{i | y; 4 O}. 

Consider now the following subspace of Hoys(& p/Zp) @ Qy, 


T" (Xp) :=ker (Frobé —q'’) . (2.3.2) 


Elements of 7” (Xp) may be viewed as “eventual Tate classes” as they will span the 
Tate classes when the base field k is enlarged, as its dimension will match the rank 
of A((Xp)z) , as shown below. 


Proposition 2.15 Using the obstruction map (Definition 2.8) we obtain the follow- 
ing bound: 


dim A” (XP)Q < dim ker 7 |Fry,.) (2.3.3) 
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Proof Crystalline cohomology base changes like the de Rham cohomology. How- 
ever, the natural action of Frob, on the extended scalars will not be linear. Instead, if 
k’/k is a base extension then the natural action of Frob, will correspond to the linear 


extension of Frobl*! from Heri p/Zy). It follows that if [ k’:k] = u then the span 
of 77 (X p) will give the new space of Tate classes. Further extensions of the base 
field will not increase the dimension of the space of Tate classes. Observing that the 
obstruction map zr extends linearly with base change, we conclude the proof using 


Theorem 2.7. oO 


In practice we will approximate Frob, to a few p-adic digits (often 5—20 digits). 
Increasing the precision is costly and each power of Frob, will lose some of that 
precision. Considering that u maybe quite large, we will compute Tk p) in the 
following way which requires taking at most the v-th power of Frob,, with v := 
max; {deg ®; | y;  O}. 


Proposition 2.16 Let T/(Xp) := ker (®;(q~" Frob,)). Then T’(Xp) = 
Dio T; (Xp), where the sum is taken over i with y; # 0. 


Proof The restriction of g~" Frob, to T° p) is annihilated by the polynomial r“ — 
1. Therefore, its minimal polynomial will have only reduced cyclotomic factors. 
Now apply the primary decomposition theorem from linear algebra [HK71, §6.8, 
Theorem 12]. oO 


2.4 Improving the Obstruction Using the Frobenius 
Decomposition 


The space of “eventually Tate classes” fT = T’ (Xp) from (2.3.2) admits the 
Frobenius decomposition T= Q; T; of Proposition 2.16. It is tempting to obstruct 
each 7; individually in order to improve the upper bounds we can get for the 
dimension of A’ (Xz). Assuming the full Tate conjecture, as described below, we 
show that this is possible here. 

The full Tate conjecture assumes, in addition to the Tate conjecture, that the 
intersection product on algebraic cohomology classes is non-degenerate [Mil07]. 
This additional requirement always holds for surfaces. In particular, K3 surfaces 
over finite fields continue to satisfy the full Tate conjecture [Chal3, Mad15, KM16]. 

For this section let A := A’(X¢z)qQ, B:= A’ (Xp)Q and H := Helys(Xp/Zp) ® 
(,. We warn the reader that B does not map into H. Pick a finite extension k’/k 


where all algebraic cycle classes of Xp can be defined. Recall that W(k’) is the 
ring of Witt vectors of k’ and Q(k’) is its fraction field. Then g~" Frobx maps B 
equivariantly into 


H! = Hyys(Xp Xk K/W(K)) Owe OK) ~ H @Q, OK). (2.4.1) 
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The natural action of Frob, on H’ is not the one that linearly extends g~" Frob, from 
H. Overcoming this non-linearity is the main technical obstacle in this section. Let 
a: Q(k’) > Q(k’) be the field homomorphism induced by the Frobenius map of k 
on k’. The natural Frob, action on H’ is the o-linear extension of the Frob, action 
on H. 


Lemma 2.17 The image of the inclusiont: A’ (Xx) — A" (Xp) is invariant under 
the action of q~" Frobx. 


Proof We need to find an action on the left hand side that commutes with the 
Frobenius action on the right hand side. Note that there is a finite Galois extension 
K'/K such that A’(Xz) = A’(Xx’). We can also choose a prime q of K’ lying 
above p for the reductions. It is standard that one can lift the Frobenius action into 
the subgroup of Gal(K’/K) that fixes q [Mil17, § 8]. Oo 


Let x4, XB € Q[t] be the characteristic polynomials of the g~” Frob, action on 
A and B respectively. We will write x# € Q[t] for the characteristic polynomial of 
q_’ Frob, on T CH from (2.3.2). We know that q_" Froby is of finite order on each 
of these spaces, therefore we can factor these characteristic polynomials as follows: 


xa =|] 9%", XB = IT". and x7 =[] 07, (2.4.2) 


L L L 


where ®; € Z[t] is the cyclotomic polynomial for any primitive i-th root of unity. 
Note that we want to determine the dimension dim A which equals }°; a; deg ®;. 

Decompose these spaces into A = Q, A;, B = Q; B; and T = Q, T; as in 
Proposition 2.16, with A; = ker (®; (q7" Frob,)) and so on. 


Lemma 2.18 For each i we have a; < B; < y;- 


Proof The first inequality a; < 6; follows from Lemma 2.17 since Bj C Aj. 
The inequality 6; < y; is more subtle, but standard. Either one can argue as in 
Proposition 2.19 below, or deduce it from the Weil conjectures in comparison with 
the étale cohomology with Q,-coefficients. oO 


For each i, let T/ ~ T; a, Q(k’) be the span of T; in H’, see (2.4.1). 
Proposition 2.19 Assuming the full Tate conjecture for X, we have T; 0 B = Bj. 


Proof For each i, let k;/k be a degree i extension and let H; be the crystalline 
cohomology of X yi := Xp Xx kj. The operator gr Frobi, —1 is linear on H; and it 
annihilates A” (Xpi). Moreover, B<j = D j<i B; coincides with A’ (Xyi) because 
both sides are the fixed elements of A” (Xp) with respect to the absolute Galois 
group of kj. 

Having assumed the full Tate conjecture, we may conclude that B<; is a Q- 
substructure for T!,. In particular, elements of B are Q(k’)-linearly independent in 
the ambient space H’. This implies the equality is , 1B = Bg;. The proposition 
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for i = 1 follows immediately. Fix i and assume that the proposition holds for each 
J <i. 

Although the Frob, action is not Q(k’)-linear on T/, the space T/ is nevertheless 
a Q-vector space that is Frob, invariant. Thus, the intersection T; 1 B is a Frob, 
invariant (-subspace of B<;. In particular, it admits a decomposition into a direct 
sum, with each component lying in one B; for j < i. But distinct spaces T’ are 
disjoint and each T; contains B; when j < i. It follows immediately that the 
intersection T/ M B rnin lie in B;. The equality follows. Oo 


For each i € Zo define the integer 7; as 


= la i (2.4.3) 


deg ®;, if mr, 40. 


Corollary 2.20 Assuming the full Tate conjecture, dime A’ (Xz) < dima, C= 
par ‘j. 


Proof With notation as in (2.4.2) we recall dim A = )°; a; deg ®;, with correspond- 
ing sums for the dimensions of B and T. Thus, we need only show a; < y; when 
the obstruction map on 7; is non-zero. If 6; = y; then, in light of Proposition 2.19, 
the span of B; and 7; must agree in H’. If rz, # 0 then the containment A; C B; 
must be strict by Theorem 2.7, implying a; < 6; = y;. On the other hand, if 6; < y; 
then there is nothing more to do. oO 


Remark 2.21 We can use the bound in Corollary 2.20 even when computing the 
obstructions with finite precision. Let r;,y to be 0 or deg ®; depending on whether 
m7, = 0 mod p . Clearly, ry <r; and dima, T —)°; ri,n also serves as an upper 
bound. 


The result of Corollary 2.20 is easy to use. However, we can improve these 
bounds by computing more. Define the following map on 7; 


Ti UK (x0), og" Frob(v),...,70(q” Froby) “| (v)) . (2.4.4) 


Observe that this map is @,,-linear and let L; = ker z;. The space L; is the largest 
Frob,-invariant subspace of ker zr7,. 


Proposition 2.22 Assuming the full Tate conjecture 
dimg A" (Xz) < > dima, L 
i 


Proof Let u; be the degree of ®;. We extend the map 7; to the span 7/ of T; in H’ 
as follows. On each element we use the same definition 
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mive (x1), m oq" Froby(v),...,7 0 (q7" Froby) “| (v)) (2.45) 


However, we modify the Q(k’)-action on the codomain so that Q(k’) acts go! -linearly 
on the i-th coordinate. As a consequence, mT is Q(k’)-linear. Therefore, the kernel 
of 7 is the span L’ of L; in H’. 

Since A; C B; is in the kernel of z and A; is invariant under Frobenius, A; is 
contained in the kernel of 1}. Thus, the Q(k’)-span of A; lies in Es Since elements 
of B; are Q(k’)-linearly independent, the same holds for A;. We conclude that 
dimg Aj < dimo’) Le = dima, Lj. Oo 


2.5 Bounds on the Characteristic Polynomial of the Frobenius 


Let Xp denote the scheme Xp x, k for an algebraic closure k of k. Let g = p” be 
the number of elements in k. The Weil conjectures (now a theorem [Del74]) tell us 
that the Hasse—Weil zeta function of Xy has the form 


2 #YX. (Fim) 2dim(Xp) . 
Z(Xp,t) := exp (>: Mee = [] aa.o, (2.5.1) 
m=1 i=0 


where P;(Xp,t) is the reciprocated characteristic polynomial of the Frobenius 
action on H¢,y.(Xp/Zp). 


P;(X, t) := det(1 — t Froby) € 1+ 1¢ZIt]. (2.5.2) 


The polynomial P;(Xy, t) has integer coefficients, has constant term 1, and all of its 
roots in © have Euclidean norm q~‘/?. This information is crucial in deducing P; 
from an approximate representation of the Frobenius (see Sect. 4.1.5). 

Note that the x2,, defined in Sect. 2.3, and P, are related by 


pPete ys, (lt = Pa (XG) 253) 


Remark 2.23 The Tate conjecture implies that the number of real roots of 
Po,(Xp Xx kK’, g7"t) € QJ[t] for a sufficiently large (but finite) extension k’/k 
coincides with dim A’ (X,). Thus, we expect the parity of dim A”(Xy) to match the 
parity of dim H2!,.(Xp/Zp) = deg Po,. 


crys 
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3 Computing in Crystalline Cohomology 


Computations in crystalline cohomology are made possible by comparing it with 
two other cohomology theories. Berthelot’s comparison theorem relates crystalline 
cohomology to a characteristic 0 de Rham cohomology. Monsky—Washnitzer coho- 
mology, on the other hand, allows one to represent the Frobenius map explicitly. In 
this section, we outline this construction. 

The de Rham cohomology is amenable to computation in general. For hyper- 
surfaces in particular, Griffiths’ explicit description of a basis makes these com- 
putations highly practical, see Sect.3.4. The approximations of the action of the 
Frobenius are made in terms of the this basis. 

This approach to approximating the Frobenius action was first conceptualized 
by Kedlaya [Ked08], first implemented by Abbott, Kedlaya and Roe [AKR10], and 
shown to be practical in larger characteristic by Costa and Harvey [Har10a, Har10b, 
Har10c, Cos15]. 

We summarize the required statements from [AKR10] and re-frame some using 
Zy coefficients as opposed to Q,, coefficients (e.g. Proposition 3.8). 


3.1 Crystalline to de Rham Cohomology 


The de Rham cohomology of an affine variety does not behave well in finite 
characteristic. On the other hand, in characteristic zero, the cohomology of a 
hypersurface is made explicit by working with the (affine) complement of the 
hypersurface. The de Rham cohomology of a log pair is what is needed to carry 
the characteristic zero advantages to positive characteristic. 

Let Y be a smooth proper variety over Zp and let X — Y be a subvariety that is 
a relative normal crossing divisor. Such a pair (Y, X) is called a smooth proper pair 
over Zp [AKR10, Section 2.2]. Denote the complement of X in Y by U := Y \ X. In 


p2r+ 
Z 


' and X a smooth hypersurface. 


practice, we will take Y = 


The de Rham cohomology, as well as the crystalline cohomology, of a smooth 
proper pair (Y, X) is well defined [Shi02, Chapter 2]. The cohomology of the 
hypersurface complement U, at least over the generic fiber, can be computed via 
the pair (Y, X). Indeed, there exists a natural isomorphism [Kat89, Theorem 6.4]: 


Hig (YY, X)/Zp) @ Qy = Hig (Ua, /Q,). (3.1.1) 


We may compare the de Rham and crystalline cohomologies of a smooth proper 
pair (Y, X) by the following generalization of Berthelot’s comparison theorem. We 
denote by Yp and Xy the special fibers of Y and X over Zp. 


Theorem 3.1 (Berthelot Comparison Theorem [Ber97, Shi02]) For each i > 0, 
there is a canonical isomorphism 
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Hig ((Y, X)/Zp) ~ Hiy.(Yp, Xp). 


The functoriality of crystalline cohomology thus equips the de Rham cohomology of 
(Y, X) with a Frobenius action. 


Let us remark that if X = @ then Hi, ((Y, X)/Zp) is denoted by H'j, (Y/Zp) as 
it coincides with the usual de Rham cohomology of Y. The analogous notational 
convention holds for the crystalline cohomology. 


3.2 Splitting the Cohomology of a Hypersurface 


We will first recall the characteristic zero statement regarding the splitting of the 
cohomology of a smooth hypersurface. Then, we will give the equivalent statement 
over Zp. 


3.2.1 Splitting in Characteristic Zero 


Let K be a characteristic zero field and X C Per a smooth hypersurface with 
complement U. By the Lefschetz hyperplane theorem and Poincaré duality, the only 
non-trivial cohomology group of X is in degree n. Moreover, there is a natural 
splitting of the cohomology of X which is orthogonal with respect to the cup 
product: 


Hig(X/K) = Hig? (Pat! /K) © Hit! (U/K). (3.2.1) 


The projection onto the first factor, the cohomology of PY ' has the following 
geometric interpretation. Thinking of the underlying complex analytic manifolds, 
there is a natural map from the singular homology of Re to the homology of 


Xq obtained by intersecting a homology class in P”*! with X. The dual of this 
map gives the projection onto the first factor above. The kernel of this projection 
is called the primitive part of the cohomology and is canonically identified with 
Hii! (U). Note that if n is odd then H?*°(P1*) is zero. 


3.2.2 Splitting Over the Ring of Witt Vectors 


We will now work over the Witt vectors Zy of the residue field k of characteristic p. 
We take X Pot to be a smooth hypersurface of even dimension, the odd case 
being much simpler and of less interest for our purposes. Recall that the de Rham 
cohomology groups of Pat X, and ( Pm X) admit canonical Frobenius actions 
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via Theorem 3.1. The goal of this section is to state and prove Proposition 3.2, which 
is not explicitly stated in [AKR10] but follows from the arguments presented there. 

To put Proposition 3.2 in context, let us recall some of the simpler results. 
From [AKR10, Corollary 3.1.4] we have 


Zy t = {0,2,..., 2n} 
Hi,(P% =}? Sal ies Vn > 0. 32,2 

ar( Zy) {6 otherwise, i= ( ) 
Furthermore, by arguing as in [Pan20, Lemma 2.4] or [DI87], we see that the 
cohomology groups Hijp(X/Zp) are Zpy-torsion-free for all i. Comparing with 
characteristic zero, we conclude that: 


Zp i € {0,2,..., 2m} \ {2r} 


Hig (x) = {6 (3.2.3) 


otherwise. 


Proposition 3.2 Suppose the degree of X is not divisible by p. Then, there is a 
natural Frobenius equivariant splitting of the cohomology of X: 


Hap (X/Zp) ~ Hg? (PZ*'/Zp)() ® Hyg! (P77! X)/Zp)(1). 


Proof Let Y = Po There is a long exact sequence associated to proper 


pairs [AKR10, Proposition 2.2.8]. Combining with (3.2.2) and ignoring the Frobe- 
nius action we get the following exact sequence: 


0 —> Ht (YY, X)/Zp) —> H3R(X/Zp) => W254? (Y/Zp). (3.2.4) 
Corresponding to the inclusion 1: X — Y we have pullback maps on cohomology 
> HR Y/Zp) > Hig (X/Zp). (3.2.5) 


Let h be the first Chern class of the line bundle Oy(1) and h* its k-th self product. 
It is easy to see that ¢ 0 15, (h") = deg(X)h'*!. Since powers of h generate the 
cohomology of ¥, and because p does not divide the degree of X, the composition 
¢ 045, is an isomorphism. In particular, the last arrow in the sequence (3.2.4) is a 
surjection and the sequence splits. 

To make the splitting equivariant with respect to the action of the Frobenius 
we must twist the two components. Recall that the cohomology of X is torsion- 
free. Thus each component injects into its tensor with Q,,. Now use the discussion 
following Definition 2.3.3 and Proposition 2.4.1 itself in [AKR10] to conclude the 
proof. oO 
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3.3. Torsion-Free Obstruction Space 


Let Pas /Zy) denote the (decreasing) Hodge filtration on the cohomology of 
the smooth hypersurface X c P?/*!. 


Proposition 3.3 For any a, b > 0 the quotient 
FH gg (X/Zp)/Fo* Higa (X/Zp) (3.3.1) 


is Zy-torsion-free. 


Proof The Hodge to de Rham spectral sequence degenerates at E, [DI87, Ill194, 
Theorem 4.2.2]. Therefore, for any s > 0, the graded piece 


Gr, := FSH32 (X/Zp)/F° Hea (X/Zp) 632) 


is isomorphic to H7’~5(X, Qh, [Dy Because the Hodge numbers of a smooth 
hypersurface depends only on degree and dimension (and not on the base field), 
we conclude that the graded pieces, Gr;, are torsion-free. 

The Hodge filtration on cohomology descends to a filtration on (3.3.1) whose 
non-zero graded pieces coincide with the graded pieces, Gr,, of the cohomology. A 
filtered module with torsion-free graded pieces is torsion-free. Oo 


Recall the set-up from Sect.2. Let mr be the obstruction map (1.1.3) from 
T’ (Xp) and let 24 be the obstruction map from A’ (Xy). Proposition 3.3 implies 
the following result. 


Corollary 3.4 A Tate class v € T’(Xy) is unobstructed if and only if a non-zero 
integral multiple of v is unobstructed. In symbols: 


mr(v) =0 = > (dn € Zo, mr(nv) = 0). 


The equivalent formulation using 14 also holds. Oo 


By itself, Corollary 3.4 does not imply that the cokernel of the inclusion map 
A’ (X) — A"(X,) is Z-torsion-free. This is because the theorem of Bloch—Esnault— 
Kerz (Theorem 2.7) works with rational coefficients and thus implies only that a 
multiple of an obstructed class in A’ (Xp) can formally lift. 

Nevertheless, for divisors on surfaces (r = 1), it is known that the cokernel of 
Al(X) > Al(x p) is torsion-free. This follows from the much stronger [Ray79, 
Theorem 4.1.2.1] or its generalization [EJ11b, Theorem 1.4]. 
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3.4 Primitive Cohomology After Griffiths 


In this section we will describe Griffiths’ basis for the primitive cohomology 
Aprim C Hog (C/Zp) of a smooth hypersurface X C Pa (Definition 3.9). The 
main result leading up to this definition is Proposition 3.8. 

Let P denote Py throughout this section and let d be the degree of X. 


Recall that Hprim is naturally isomorphic to Higg := Hin ‘, X)/Zy). When 
tensored with @,, this logarithmic cohomology coincides with the cohomology of 
the generic fiber U of the complement U := P \ X of our hypersurface X. That is, 


Hog @Zy, Qy X Hi! (U/Q,). (3.4.1) 


The (n + 1)-th de Rham cohomology of the affine variety U is readily computed. 
Consider the following map, induced by exterior differentiation on global sections, 


F :=H(P, Q/z, (a X)) 4. ¢6:=P@, Qhz, ((n + 1)<X)) (3.4.2) 


We will denote the quotient G/dF by I. Using characteristic 0 arguments [Voi07, 
§6.1], we know l' @z,, Qp ow Hip (U/Qy»). In light of (3.4.1), we will compare Hiog 
with T. 

We have a filtration G* on G induced by the pole order of forms: 


Gain (He, amie (n+ 1 = j) 0) > HCP, QB, (n+ 1)- x))) 
(3.4.3) 
The induced filtration on I will be denoted by T°, so that F/ = im (G/ > i), 

Let R = Zpy[x0,...,%n41] and f € R be a polynomial defining X. Let J = 
(do f,---, On+1f) in R be the Jacobian ideal of f, where 0; denotes differentiation 
by x;. We will write S for the quotient R/J. Homogeneous components of R, S, J 
will be denoted by subscripts. Let N; := (n +1— j)d —n—2 for j =0,...,n. 
We have a natural isomorphism 


— __volp, (3.4.4) 


here volp is a natural generator of QD, (n + 2) given by 


n+l 
volp = } (=D)! xidxg A+++ A dai A+++ A dxnqi. (3.4.5) 
i=0 


Lemma 3.5 If p > n+ 1 and p { d then the map Ryn; > T/ induces an 
isomorphism 
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ize q j+il 
(R/Dy, > Ire, 


Proof This is standard in characteristic 0 [Voi07, Gri69]. The image of F — G is 
generated by elements of the form 


(= = 20) volp and (*# om me Er) voln (3.4.6) 


eachm = 1,...,n+landg € Ry,,,_,,- Since p{d, f € J andsince p>n+1 
we can divide by m in the relations above. It follows that, arguing as in [Gri69, $4], 
the pole order of an element in G can be reduced modulo dF if and only if it is in 
the image of J. oO 


Lemma 3.6 If p { d then R/J, and in particular (R/J)y for each N > 0, is 
Zy-torsion-free. 


Proof Let K stand for either the residue field or of the function field of Zy. In either 
case, with S = R/J, S @z, K is a local complete intersection ring of dimension 
zero. Since J ®@ K is generated by n + 2 elements forming a regular sequence each 
of degree d — 1, the Hilbert series of S ® K is independent of K. We used the 
smoothness of X here and that f € J when p { d. Since dimgx S® K is independent 
of K, S is torsion-free. oO 


Lemma 3.7 [f p { d and p > n+ 1 thenT is torsion-free. 
Proof Lemmas 3.5 and 3.6 imply that the graded pieces of Fare torsion-free. O 


In light of the three lemmas above, we conclude that I injects into Pr @z, Qy, 
which in turn is isomorphic to Higg ® Qp and Aprim @ Qy. Recall that the de Rham 
cohomology of X is torsion-free and by Proposition 3.2 so is Hiog. Thus we have 
two injections: 


> Aprim @ Qy << Alog, ifp{d, p>n+t+l. (3.4.7) 


Proposition 3.8 If p { d and p > n+ | then the images of V and Hig in (3.4.7) 
coincide. In particular, T is isomorphic to Hog. 


Proof This is now a consequence of Remark 3.4.4 and Corollary 3.4.7 of [AKR10]. 
oO 


Impose the grevlex monomial ordering on R and on R @k ~ Fg[xo, ..., Xn41]. 
As aresult, there is a natural ordered monomial k-basis B” for the quotient R/J @k. 
Let B’ C R be monomials that map to B’. By Nakayama’s lemma, B’ is a Zp- 
basis for the free-module R/J. We will write B C Hprim for the image of Bn 
D'i-o (R/J) Nj with respect to the following composition of maps: 


n 


QD (R/Dy, > Aprim. (3.4.8) 
j=0 
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Definition 3.9 When p { d and p > n+ 1 then B C HAprim Will be called the 
Griffiths basis for the primitive cohomology of X. 


A particularly potent feature of the Griffiths basis is that it “respects the Hodge 
filtration” in the following sense. Write B = (w j,..., ws) in increasing grevlex 
ordering. If F* Hprim is the Hodge filtration on the primitive cohomology then 


Vf SOc ty Jig, Si Bp = Zp iy see e,) (3.4.9) 


This follows from the fact that pole order filtration on  @ ,, coincides with the 
Hodge filtrations on Hjp(U/Q,) and on Aprim @ Qy [Voi07]. 


3.4.1 Approximating the Frobenius Matrix in Terms of the Griffiths Basis 


We comment briefly on how to compute an approximation of Frobenius action 
Frob, on the primitive cohomology Hprim of a hypersurface X. We will focus on 
the even dimensional case, but the odd dimensional case requires minimal change. 
We recommend [AKR10], [Cos15] or [CHK19] for a more detailed discussion. We 
assume p { d and p > n+ | throughout. 

Recall U = P\ X, U = U®@ Q, and that the de Rham cohomology Hy := 
Ha : (U/ Q,) of U is identified with Hprim ® Q,. Note also that Aprim is torsion- 
free and the natural inclusion 


Hprim(—1) > Hprim ® Q, > Hu (3.4.10) 


is equivariant with respect to the Frobenius action [AKR10, Proposition 2.4.1]. 

In the previous section, we represented elements of Hprim and of Hy using 
polynomials. We will work with the Griffiths basis B C Hprim. Let xP. Bs | 
Zy[Xo, -.-,Xn41] be the monomials that map to B. The image of B in Hy is given 
by (3.4.4). Namely, with J; = (deg (x47) +n+ 2) /d the following elements give 
the corresponding basis in Hy: 


Bi 
= = volp modG, i=1,...,s. (3.4.11) 


Here, one switches to another cohomology theory. Since U is affine, Hy can be 
computed using the Monsky—Washnitzer cohomology. The advantage of Monsky— 
Washnitzer cohomology in this context is that it provides great flexibility in how 
one can choose to represent the Frobenius matrix in the chain of forms that compute 
the cohomology [AKR10, Definition 2.4.2]. 

The Frobenius action can be described on nj; and Griffiths-Dwork reduction 
allows one to re-cast Frob,(7;) in terms of the Griffiths basis again. The catch is 
that Frob,(7;) becomes an infinite sum, each term having coefficients with higher 
and higher valuation. Truncating the infinite sequence and applying the Griffiths— 


316 E. Costa and E. C. Sertéz 


Dwork reduction gives an approximation of Frob,(7;) in the Griffiths basis. The 
degree of truncation required to attain the desired precision is discussed in [AKR10, 
§3.4]. 


4 Main Algorithm 


This section states and explains the steps of the main algorithm of this paper, 
Algorithm 4.1. The version here provides an upper bound for the geometric middle 
Picard number of a given smooth hypersurface. A simple variation gives bounds 
on the geometric Picard number of a Jacobian, see Sect. 5.1. More sophisticated 
variations allow for the study of non-degenerate hypersurfaces in simplicial toric 
varieties [CHK 19]. 

The algorithm takes the equation f € K[xo,..., X2-41] of a smooth hypersur- 
face X C PE tl as input. The output is an upper bound for the geometric middle 
Picard number, p’(X x x K), of X. Optionally, one may choose to specify a lower 
bound for the precision N used to approximate the Frobenius map. For small N, the 
upper bound may improve as N is increased. The bound will eventually stabilize. 
One may also provide a lower bound on the characteristic p of the prime p C Ox 
to be used for the good reduction of X. 


4.1 Clarification of the Steps in the Algorithm 
4.1.1 Pick a Good Prime 


Pick the first prime number p exceeding 2r + 6 and char_bound. Choose an 
unramified prime p of K lying above p. By clearing the denominators of the 
polynomial f defining X, create a model X/Zy of X. Check if the model X/Zp 
is smooth. If not, pick another prime and repeat. 

It is possible to eliminate the simple but haphazard searching method above: 
Compute the discriminant of X over Z and avoid the primes dividing this discrimi- 
nant. However, discriminants tend to be huge. 


4.1.2 Compute a Griffiths Basis for Primitive Cohomology 


In Sect. 3.4 we described how to compute a Griffiths basis for the primitive part 
of HAR (X/ Zy). Choosing the polarization to complete the basis, we represent the 
arithmetic Frobenius map Frob, on the cohomology Hee Ch /Zp) = Hey p/Zp) 
via the Berthelot comparison theorem, see Theorem 3.1. This square matrix with 
Zy entries will be denoted by [Frob,]. 
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Input : f € Ox[x0, x1,..., 2-41]; Optional: precision_bound, char_bound € N 

Output: An upper bound for the geometric middle Picard number of X = Z(/). 

function BoundRank ( f; precision_bound = 1, char_bound = 3) 

// Pick a good prime, with residue characteristic at least char_bound, Sect. 4.1.1 

p:=next_good_prime (f, max (2r + 6, char_bound) ) 

// Compute a Griffiths basis for primitive cohomology, Sect. 4.1.2 

B:=griffiths basis (fp) 

// Compute a minimal working precision, Sect. 4.1.3 

gq := #k,m:= #B+ 1 

N := max (precision_bound, log, (2(;,2oq)a"21)) // See Eq. (4.1.3) 

// Compute an N-digit approximation of the Frobenius matrix, Sect. 4.1.4 

[Froby,v] := frobenius (f p, N, B) 

// Compute the characteristic polynomial of the Frobenius matrix, Sect. 4.1.5 

x(q" Froby) := weil characteristic _polynomial(Frob,y, N) 

// Represent the obstruction map, Sect. 4.1.6 

[wx] := projection _map(B) //mx: H3e (X) > HR (X)/P" HA (X) 

// Loop over the cyclotomic factors of x (q~" Frob,), Sect. 4.1.7 

bound := 0 

for oy € cyclotomic _factors(x(q~" Frob,)) do 

// Approximate the space of Tate classes, Sect. 4.1.8 

Br, y ‘=tate_basis_ matrix (;, Frob,,, N) // columns form a basis 
for Tj,v 

// Approximate the map 77, Sect. 4.1.9 

(7i,n] := [ax]: column_matrix (1, [Frobky],---, [Frob,,  ]*°2 2-1) ‘Br, y 

// Increment by the bound on the dimension of L; := ker 7;, Sect. 4.1.10 

bound := bound + deg oy bound_rank([zj,n]) 


Ee return bound. 
Algorithm 4.1: Computing an upper bound for the geometric middle Picard 
number 


4.1.3. Compute a Minimal Working Precision 


The dimension of the middle cohomology of a hypersurface of dimension 27 and 
degree d is given by the formula 


(d= 1)7** 4 2d —1 
m i= d F 


(4.1.1) 


In any case, m = #B+1 where B is the Griffiths basis for the primitive cohomology. 
The Weil conjectures imply that the reciprocal characteristic polynomial of Frob, 
on HaR(X/ Zy) has integer coefficients with constant term equal to 1, i.e., 


Po, (X, t) = det(1 — t Froby) € 1 + ¢Z{t]. (4.1.2) 


Moreover, this polynomial is completely determined by the coefficients of t/ with 
i =0,..., [m/2], and these coefficients have absolute value at most 
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eae (4.1.3) 


Thus if pY exceeds twice this bound, then P2,(X, t) is determined by its reduction 
modulo p’. The bound above might be significantly improved by employing 
Newton identities, see [Ked13, slide 8]. If the precision requested by the user is 
not sufficient to lift P2,(X, t), we increase it accordingly. 


4.1.4 Compute an N-digit Approximation of the Frobenius Matrix 


With N as above, we may compute a matrix [Frob,,,] approximating the matrix 
[Frob,] to N p-adic digits as explained in [AKRI10], [Cosl5] or [CHK19]. 
We sketched the idea in Sect.3.4.1. In practice, one may use the library 
controlledreduction.” 


4.1.5 Compute the Characteristic Polynomial of the Frobenius Matrix 


As explained above we may deduce P2, from det(1 — t Frob,,y). In practice, we 
compute x (Froby y)(t) = x (Frob,)(t) mod p and lift each coefficient to the 
interval [—p /2, p% /2]. By the discussion above, the representatives of the last 
coefficients of t/ fori = [m/2],...,m match exactly their lifts, and the remaining 
coefficients are deduced using the functional equation 


xX (Frob,)(t) = +(q~"t)” x (Froby)(q*" /t). (4.1.4) 


The sign of the functional equation is the sign of det(Frob;), which we are able to 
compute to one significant p-adic digit. 


4.1.6 Represent the Obstruction Map 


The obstruction map my (see Sect.2.1.6) annihilates the polarization. Thus, it 
remains to describe 2 on the primitive cohomology. Since the Griffiths basis on 
the primitive cohomology respects the filtration (see Sect. 3.4), the map mx on the 
primitive cohomology in the Griffiths basis is just the projection onto the last few 
coordinates. Let [7] be the matrix representation of this projection. 


?This library is made available in SageMath [Sag20] through the wrapper https://github.com/ 
edgarcosta/pycontrolledreduction. 
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4.1.7 Extract Cyclotomic Factors from the Characteristic Polynomial 


Factorize the characteristic polynomial over Q([f] as in (2.3.1). 


4.1.8 Approximate the Space of Tate Classes 


Let ;(t) be a cyclotomic factor computed in the previous step, let By, be a 
basis for the eigenspace Tj, := ker (0; fa Froby. v )). This 7; y approximates 
T/ (Xp) := ker (®;(q~" Frob, )) from Proposition 2.16. 

The kernels are computed using standard algorithms that keep track of the p-adic 
precision, see [CRV18]. The computations equip 7;, with a basis which we record 
in the columns of a matrix Br, ,. 


4.1.9 Approximate the Map z; 


Restricting zx on to 7; , we obtain the approximation 
mr, y: Tin > HiR(X)/FH3R (X) (4.1.5) 


of the obstruction map 7, on T/ (Xp). Therefore, the matrix representation [77, , | 
of 7, y is then equal to [7] - By, ,. Similarly, we may approximate the map 


r 2r rpy2r ae 
en ae (H3RCO/F Hiafe(2)) 
vt> (r(v), 7 0g * Froby(v),..., 70 (q" Frob,)*2 ®'—!(y)), 
(4.1.6) 
from Sect. 2.4, is approximated by 
1 
q "[Froby,y] 
[zn] = [1x] * . ; Br, y- (4.1.7) 


(q~" [Froby, v8?! 


4.1.10 Bound Dimension of L; 


The approximation 77, of 2; allows for the computation of a lower bound b;,y on 
the rank of 7;, see for example [AKR10, Algorithm 1.62]. Although we will not 
know when this happens, if N is large enough, then the rank b;,y of ;,y will match 
the rank of z;. Nonetheless, as L; := ker7;, we have 


dim L; = dim T; — rk; < dim T; — bj,n. (4.1.8) 


320 E. Costa and E. C. Sertéz 


4.1.11 Return the Upper Bound on the Middle Picard Rank 


From the previous argument at the end of the for loop we have )0; dima, Li < 
bound. Combining this with Proposition 2.22 we obtain the sought inequality: 


dimg A’ (Xz) < dima, Li < bound. (4.1.9) 
i 


Remark 4.1 Let us warn once again that, even in the favorable conditions provided 
by K3 surfaces, the inequality (4.1.9) can be sharp. See Example 5.8 for a 
demonstration. 


5 Examples 


We now give explicit illustrations of the methods developed in this paper. We have 
implemented a version of Algorithm 4.1, called crystall ine obstruction,’ 
where the prime is also given as input. We will show its basic usage below. 

There are three sets of examples below. In the first set, we work with Jacobians 
of curves. The Hodge structure on their cohomology is inherited from the Hodge 
structure of the corresponding curves. The advantage is that the dimension of the 
cohomology is small enough that we can write the Frobenius matrices explicitly. 

In the second set, we illustrate the basic usage on surfaces in projective space. 
We checked all of the Picard numbers in the quartic database* of [LS19]. In every 
case, our upper bounds agreed with the numbers listed there. We also comment on 
the performance gain in using the obstruction method. 

In the third set, we give pathological examples. For example, the obstruction 
space associated to a quintic surface is four dimensional. We give an example where 
the image of the obstructed Tate classes span a one dimensional space although four 
dimensions of algebraic classes must be obstructed. 

Our convention for writing a p-adic number modulo p% is as follows: given 
a € Q, we write a = p” -b mod p% where m € Z, b € Zso, pt b. 


5.1 Jacobians of Plane Curves 


The discussion on hypersurfaces above allows us to compute the Hodge decom- 
position on the first cohomology of a smooth curve. Since the cohomology of the 


3For its implementation in SageMath [Sag20], see https://github.com/edgarcosta/ 
crystalline_obstruction. 


4One may view the database at https://pierre.lairez.fr/quarticdb/. 
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Jacobian is isomorphic to the natural Hodge structure on the wedge powers of the 
first cohomology of the curve, we can treat Jacobians explicitly. We can thus use 
Algorithm 4.1 with minor modifications, as illustrated below. The purpose of this 
change in context is to be able to display complete examples in print. 


Example 5.1 We start with a genus 2 curve 
C/Q: y? — (4x° — 36x4 + 56x? — 76x? + 44x — 23) = 0. (5.1.1) 


Choose the prime p = (31) and write f for the equation of C. This curve has 
LMEFDB label 1104.a.17664.1. Let J denote the Jacobian of C. We will compute the 
geometric Picard number of J. 

When given a curve, the code crystalline obstruction understands 
that the intention is to compute with the Jacobian of the curve. The command is 
simple: 


sage: from crystalline obstruction import crystalline 
_obstruction 
sage: crystalline obstruction(f, p=31, precision=3) 


The first entry of the output is the upper bound (in this case sharp) on the geometric 
Picard number of J, and the second is a dictionary recording relevant intermediate 
results. The entire computation for this example takes less than a second giving the 
output: 


(1, {' precision’ : 3, ‘p’: 31, ‘rank T(X_Fpbar)’: 2, 

‘factors’: [(t - 1, 2)], 

‘dim Ti’: [2], ‘dim Li’: [1]}) 

We will now walk through the intermediate steps of the computation. We first 
check that J has good reduction over p. Let C and J denote the natural models 
of C and J over Zy, respectively. As discussed in Sect.3, we may compute an 
approximation of Frob, on Hip (C/Zp) @z, Q, = Hip (J/Zp) ®z, QQp and its 
characteristic polynomial: 


31-482 31-284 16241 3075 
31-386 31-886 2644 12126 ; 

Frob = d 31), 

TO hate (/Q,) = | 31.284 31-659 6336 9750 | “Od 31? (5.1.2) 
31 - 194 31 - 876 27408 10841 


x =1 — 3t + 140? — 9327 + 96124. 


The natural Hodge structures on both sides of the isomorphism Hip (J /Qp) = 
AHIR /{Qp) agree with one another. Thus the wedge products of the Griffiths 


basis we used for C will reveal the Hodge structure on higher cohomologies of J. 
From x and the approximate Frobenius above we can deduce 
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det(t — Frob|y2_ (gy) = @ — D714 + 482° + 4317 + 481 + 31)/31, 


‘crys 


312-19 31 - 660 31-776 31 - 843 31 - 506 22499 
312-18 31-250 31-459 31-270 31 - 683 10699 
312-3 31-154 31-636 31-261 312-24 3010 


= 3 
Frob |13.(/Q,) = | 312.29 31-557 31-664 31-392 312-23 10438 | “med 31). 
312.30 31-77 31-516 312-26 31-449 3650 
312.7 31-668 31-509 31-277 31-513 17591 
(5.13) 


and T! (Jp) = Span(v1, v2), where, having lost a digit of precision, we have 


v} = (356, 37, 831, 0, 295, 31) (mod 317) 


(5.1.4) 
v2 = (4, 957, 3, 1, 0, 0) (mod 31’). 
The Hodge numbers for Hae (J/Q,) are (1, 4, 1), and the projection 
mg: Hag (J/Qp) > Har (J/Q,)/F' Hag J/Qy) ~ Qy (5.1.5) 


corresponds to the projection onto the last coordinate in the basis chosen above. 
This gives 


mr = (310) (mod 31’) (5.1.6) 


in the basis v1, v2 for Tg p)- Thus, the rank of the obstruction map zrr is at least 
1. This is the first entry in the output of our code. 

We also know that the polarization lifts and thus dima A! V@Q -_ 
rk End(J@) = 1. 


Example 5.2 Let us now look at an example where the rank of the Jacobian is 
known to be 2. In this case, an upper bound of 2 will not prove that the rank is 
indeed 2. However, our numerical methods has the advantage that if the rank was 
1 then the obstruction map would be non-zero at high enough precision. Observing 
that the obstruction is zero to higher and higher precision would be compelling 
evidence that the rank is 2. 

We pick the genus 2 curve C: y? = x° —2x4+7x? — 5x? +8x +3 over Q with 
LMEFDB label 30976.a.495616.1. Mutatis mutandis, we may repeat Example 5.1, for 
p # (2), (11). However, the Jacobian J of C will now have real multiplication by 
Q(V17), see [CMSV19]. Therefore, dima A! (Jeg) = 2. 

Working with finite precision, it is impossible conclude that the obstruction map 
is identically zero. We observe however that even over a large prime such as p = 
4999 the obstruction map on the two-dimensional T! (J/ Q,) is zero modulo ge, 
Even with these large numbers, the computation took about 3 min. 
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Example 5.3 In this example, we will show how one can obstruct more than one 
linearly independent cycle at a single prime. Unlike [EJ11b], this method does not 
require an investigation of the geometry of algebraic cycles. 

Consider the following genus 3 plane curve, its defining equation will be denoted 


by f: 
C/Q: xy? + x3z — xy?z + x727 + y?2? — yz? =0. (5.1.7) 


The following command takes less than a second to return the answer, which 
shows that dim A! (J) = 1 where J/Q is the Jacobian of C: 


sage: crystalline obstruction(f, p=31, precision=3) 
(1, {'’precision’ : 3, ‘'p’: 31, ‘rank T(X_Fpbar)’: 3, 
‘factors’: [(t - 1, 3)], 
‘dim Ti’: [3], ‘dim Li’: [1]}) 
With p = (31) and C/Zy, J/Zp as in Example 5.1, we demonstrate the intermediate 
steps: 


xX =1 + 782? + 40827 + 241814 + 76882 + 297911 


31-104 31-218 31-7 27783 2569 7195 

31-351 31-494 31-690 19524 8323 1421 

Frob [ys = 31-50 31 - 237 31 - 829 13467 20050 19610 
ar(C/Qyp) 31-19 31 - 733 31 - 377 20592 23805 15085 

31 - 482 31-610 31 - 793 12397 28951 6604 

31-710 31 - 860 31 - 689 19294 18382 25376 


(mod 31°). 


Now the Hodge numbers for Hi, J / Q,) are (3, 9, 3) and we have 


det(t — Frob ae 1)3(313 + 15 - 31?t — 642 - 3117 
e ( — Fro IH2.ys(Jp)()) = ap" = ) ( + . t-— . t 
+ 145451? + 3606014 + 180637° + 3172r° + 1806377 +--+). (5.1.8) 


We find that the space of Tate classes, Tg / Qy) = ker (Frob, —1), is three- 
dimensional. The obstruction map zr is approximated by the following matrix with 
respect to a basis: 


31-240 031-1 


mr =} 31-515 31-1 0] (mod 31°). (5.1.9) 
0 0 0 


Thus, rk 27 > 2 and we conclude dim A! (JQ) = dim End(Jq) =1. 
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Example 5.4 We may also use our method to directly bound the dimension of the 
geometric endomorphism algebra of an abelian variety. We do this via divisorial 
correspondences. 

If X and Y are two projective smooth varieties over a number field K, then 


NS(X x Y) = NS(X) @ NS(Y) © DC(X, Y) 


(5.1.10) 
C Hig (X/K) ® Hig (¥/K) © Hag (X/K) ® Hig (¥/K), 
where NS denotes the Néron—Severi group and DC denotes divisorial correspon- 
dences. The Néron-Severi group NS coincides with our A!. 
Furthermore, by [Lan83, VI §2 Thm 2] we have the following isomorphism 


DC(X, Y) ~ Hom(Alb(X), B(Y)), (5.1.11) 


where Alb(X) is the Albanese variety of X and §8(Y) is the Picard variety of Y. 

Note that the Hodge structure of the tensor Hig (X/Q p® Hig (Y¥/Q p) is induced 
by the Hodge structure of its two factors. Therefore, knowing the Hodge structure on 
the factors allows us to apply our method to bound the dimension of DC(Xq, Yo). 

For example, taking X = Y = C for acurve C/K, we can bound the dimension 
of the endomorphism algebra End(J@) of the Jacobian J of C. The method is 
analogous to the previous examples. We let J/Zy denote the natural model of J 
and replace Haat / Q,) with Hid /Qp) ® Hind / Q,) in the computations. 

Our code automates this procedure with the keyword argument tensor=True. 
For example, take the smooth plane curve 


C/Q: — yt +2374 2x77? — x23 =0. (5.1.12) 


The following commands show that dim End(Jq) < 9 and dim NS(VJ/Q@) < 5, 
respectively. 


sage:crystalline obstruction(f, p=43, precision=4, 

tensor=True) 

(9, {"’precision’: 4, 'p’: 43, ‘rank T(X_Fpbar)’: 18, 
‘factors’: [(t + 1, 4), (t - 1, 6), (t*2 +1, 4)], 
‘dim Ti’: [4, 6, 8], ‘dim Li’: [2, 3, 4]}) 

sage:crystalline obstruction(f, p=43, precision=4) 

(5, {"’precision’: 4, 'p’: 43, ‘rank T(X Fpbar)’: 7, 
‘factors’: [(t + 1, 2), (t - 1, 3), (t*2 +1, 1)], 
‘dim Ti’: [2, 3, 2], ‘dim Li’: [1, 2, 2]}) 
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5.2 Surfaces in Projective Space 


Quartic surfaces and quintic surfaces are the prime examples in this section. We 
present some successful examples and some examples that demonstrate inherent 
weaknesses of the method. 


5.2.1 K3 Surfaces 


We begin with quartic smooth surfaces in P*. The middle cohomology of a K3 
surface has dimension 22 with Hodge numbers (1, 20, 1). 


Example 5.5 We applied our Algorithm 4.1 to each of the 184,725 quartic surfaces 
in the quartic database? [LS19]. For each quartic, we found an upper bound that 
agreed with the Picard number listed there (and, as expected, could not find a smaller 
upper bound). 

We will now compare the performance improvement of using the obstruction 
map. We distinguish between three different methods: The Galois obstruction 
method applies Algorithm 4.1 thereby using the obstruction map on each of the 
Galois orbits separately as in Proposition 2.22. The vanilla obstruction method 
deviates from Algorithm 4.1 only in ignoring the Galois structure on the space of 
Tate classes, that is, the obstruction map is considered on the entire space of Tate 
classes. For each surface, we start with the prime 23 and move up to the next prime 
if the upper bound does not match the number listed in the database or if the prime 
is not a good prime. We skip the primes less than 23, as the computation to deduce 
the Hasse—Weil zeta for these requires more precision [Cos15, Section 1.6.2], and 
hence more time. 

In applying the van Luijk method [vLui07] to each surface, it would make sense 
to stop after two primes have been found where the upper bound is optimal, whereby 
the discriminants can be compared. This means it would require, on average, twice 
as many prime reductions as the vanilla obstruction method. Almost the entirety 
of the computation per prime is spent on computing the zeta function, a feature 
common to all three methods. Therefore, the van Luijk method would take roughly 
twice as long as the vanilla obstruction method. On Fig. 5.1, its slope would be half 
that of the vanilla obstruction—or about one fifth of the Galois obstruction method. 

The entire computation for the Galois obstruction method took about 10 months 
of CPU time. The vanilla obstruction method takes about 16 months. Arguing as 
above, we expect the van Luijk method to take about 32 months. The Figs. 5.1 
and 5.2 give more detailed information. 


5One may explore the database at https://pierre.lairez.fr/quarticdb/. 
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Fig. 5.1 The fraction of surfaces from the quartic database that needed a certain number of prime 
reductions until sharp upper bound is attained 
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Fig. 5.2 Fraction of surfaces from the quartic database against time needed until a sharp bound is 
attained via Galois obstruction method 


Example 5.6 In this example, we give a simple but artificial example which 
demonstrates that there is no global upper bound on how much precision must be 
used to reach the optimal bounds of the method. 

Fix a prime p and let A = p”™ with large m > 0. Consider the following two 
elements of the Dwork pencil 
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X/Q: xt+yt+2ttwt+rxyzw =0, (5.2.1) 
Y/Q: xt4 yt 244 wt=0. (5.2.2) 


The Fermat quartic Y has Picard number 20 and it is known that Y, has Picard 
number 20 when p = | mod 4. Clearly, the two models X and Y lifting X» = Yp 
are indistinguishable modulo p”. If X has Picard number 19, then the obstruction 
map will be zero modulo p” as it will not be able to make the distinction between 
X and Y. Although, when the p-adic precision is sufficiently large, the obstruction 
becomes non-zero. 


Example 5.7 Take p = (89) and consider the following quartic surface 
X/Q: yo—-xeztye+zw3+wt =0. (5.2.3) 
The characteristic polynomial of 897! Froby on He p/Qp) factors as 


(t—1)>-(t +1): (#4 + 1) - (89 + 18827 + 30324 +316r° +30328 + 188710 + gor!) /89. 
(5.2.4) 


Therefore, the geometric Picard number of X is at most 10. By computing an 
approximation of the obstruction map on the Tate classes, we would at most improve 
this bound by 1. Instead, we use Corollary 2.20 to drop the rank from 10 to 4. 

For instance, we observe that the obstruction is non-zero on the space 


Ty (Xp) := ker(Froby +p*) C Hoys(Xp/Zyp). (5.2.5) 


This already allows us to drop from 10 to 6, since deg(t* + 1) = 4. Analogously, we 
work through the other two cyclotomic factors, observing a non-zero obstruction in 
each case. As these two cyclotomic polynomials are linear, we drop from 6 to 4. 
All of this is automated. One may use crystalline obstruction to 
deduce the bound 4. The following computation takes less than 4 min of CPU time. 


sage: crystalline obstruction(f, p=89, precision=3) 
(4, 
{'precision’: 3, ‘'p’: 89, ‘rank T(X Fpbar)’: 10, 
‘factors’: [(t - 1, 1), (t + 1, 1), (t - 1, 4), 
(t*4 + 1, 1)], 
‘dim Ti’: [1, 1, 4, 4], ‘dim Li’: [1, 0, 3, 0]}) 


Example 5.8 Consider now the K3 surface 
X/Q: x4 + 2y* + 2yz? + 324 — 2x3 w — 2yw3 = 0. (5.2.6) 
At p = (43) the geometric Picard number of Xy is two, and one Tate class is 


obstructed, thus the geometric Picard number of X is one. 
Now take p = (67). The characteristic polynomial of 677! Froby factors as 
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Gai) 1 467 — 1718! 4 Bie = 43 = 716 4 or 
+ 1781° — 215¢7 + 5418 + 502? + 541!°...)/67, (5.2.7) 


thus the geometric Picard number of Xy is 4. 

As the polarization is unobstructed, there is only one irreducible space to 
consider, corresponding to (t+1)?. Therefore, in contrast with the previous example, 
we may only obstruct the liftability of one class, as the obstruction space is one 
dimensional. Hence, we cannot obtain a sharp upper bound for the geometric Picard 
number of X by studying its specialization at p = (67). 


Example 5.9 Now we look at a K3 surface with real multiplication. Consider the 
K3 surface given by desingularizing the following double cover of the projective 
plane. It is given in the weighted projective plane P(1, 1, 3) and is branched over 
six lines. 


X/Q: w= (-y?/8 + yz- 2°) (7x?/8 +5xz+ 72°) (2x? + 3xy + y’) : 
(5.2.8) 


This is the specialization of the family X°-” in [EJ14, Theorem 6.6] with t = 1. 
By loc. cit. it has geometric Picard number 16 and real multiplication by Q(/2). 
In particular, every reduction of X will overestimate its geometric Picard rank by a 
multiple of 2 [Chal4]. In fact, the reductions must have geometric Picard rank 18 
or 22 [EJ14, Corollary 4.12]. The van Luijk [vLui07] method fails in this case. We 
now show that the obstruction method will succeed. 

Take the prime p = (83). By counting points on the singular model, we deduce 
that the characteristic polynomial of 837! Froby can be factored as x; - x2 where 


y= ¢—1)¢4+ 0%, x2 = 7 + 1(83 — 15877 + 8324/83. (5.2.9) 


Here, x; corresponds to the action of the Frobenius on the known 16 algebraic cycles 
on X. The factor x2 corresponds to the action on their orthogonal complement (the 
transcendental part). 

In keeping with the notation of Proposition 2.16, let 7 (Xp) := ker (Frob? + p’). 
From the argument above and Proposition 2.19, we conclude that T (Xp) spans 
algebraic cycles in Xp which do not lift. In particular, by Theorem 2.7, the 
obstruction map must be non-zero on ce (Xp). 

Consequently, using sufficiently high precision, one can detect that the obstruc- 
tion map 7, is non-zero. Using Corollary 2.20 one would then conclude that the 
geometric Picard number of X is 16. We did not implement the computation of the 
Frobenius on a smooth variety given as the resolution of a singular hypersurface. 
Therefore, we cannot comment on the precision required. 
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5.2.2 Quintic Surfaces 


We now consider smooth quintic surfaces. For these surfaces the Hodge numbers are 
(4, 45, 4). Now the obstruction space is four dimensional, and thus favoring richer 
examples. 


Example 5.10 Consider the following quintic surface 
X/Q: 10x*y — 32° +3x4w + 3y4w — 10xw*t — 23x yz = 0. (5.2.10) 


We compute that X, has Picard number one at the prime p = (13). This means 
the Picard number of X is one. On the other hand, the reduction Xp at p = (23) 
has 5-dimensional space of Tate classes. Fortunately, the obstruction space is 4- 
dimensional and we see that z7 is indeed rank 4. Thus, this method allows for the 
correct determination of the Picard rank of X at a prime where the jump in Picard 
number is four. It took about | day of CPU time per prime to compute the Hasse— 
Weil zeta function and an approximation for the Frobenius matrix. 


Example 5.11 The following example suggests that we are not necessarily immune 
to symmetries in the transcendental lattice. Take the quintic surface 


X/Q: 3x4z + Oxy* + 9y223 +. + 5w? =0. (5.2.11) 


The geometric Picard number of Xp at p = (31), and therefore of X, is one. On 
the other hand, the dimension of the space of eventual Tate classes T! (Xp) of Xy at 
p = (29) is 5, and the characteristic polynomial of the Frobenius action on qT! (Xp) 
is (t — 31)°(t + 31)*. This time, the following result shows that there is another 
theoretical limit to the best possible, nee bound we can coupe at p = (29). 

We recall the decomposition T1(X p) = 7} (Xp) B i. (Xp) as in Proposi- 
tion 2.16. 


Proposition 5.12 The images of the obstruction maps mr, and m7, coincide. This 
image has dimension 1, although the codomain has dimension 4. 


Proof We use the action of the involution 1: y +» —y on the obstruction space and 
on the Tate classes. By inspecting the Griffiths basis, it is immediately seen that the 
(—1)-eigenspace of «* on the obstruction space is one dimensional. 

We claim that all primitive Tate classes are (—1)- Sees of c* on cohomol- 
ogy. To show this, we computed an app oeenanen of T! (Xp) and projected this 
approximation to the (—1)-eigenspace of *. The minimal valuation of the 4 x 4 
minors of a basis of the image is 0, even with 10 digits of approximation. This 
proves the claim. 

The obstruction map is 1* equivariant and the polarization is unobstructed. 
Therefore, the image of the Tate classes via 7; must be confined to a one 
dimensional space. We observe that the two ranks are at least 1 by computation. O 
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We note another behavior to this problem at another prime. At p = (23), 
the characteristic polynomial of the Frobenius action on Py) is now (t — 
23)7(t + 23)(t? + 237). Since the Galois representation Tix p) Splits into more 
invariant pieces, and one piece is two dimensional, we can drop the rank to | using 
Corollary 2.20. It takes about 1 CPU day per prime to get these results. 


Remark 5.13 By the computations at p = (23), (29) and (31), we know that the 
dimension of the endomorphism algebra of the transcendental lattice is at most 4. 
Furthermore, we note that X has another automorphism given w +> ¢5w, thus the 
transcendental lattice has CM by (és). Then, one expects the Picard number of the 
reductions to be congruent to 1 modulo 4 [Chal 4]. 
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Conjecture: 100% of Elliptic Surfaces M®) 
Over Q have Rank Zero Becks 


Alex Cowan 


Abstract Based on an equation for the rank of an elliptic surface over Q which 
appears in the work of Nagao, Rosen, and Silverman, we conjecture that 100% 
of elliptic surfaces have rank 0 when ordered by the size of the coefficients of 
their Weierstrass equations, and present a probabilistic heuristic to justify this 
conjecture. We then discuss how it would follow from either understanding of 
certain L-functions, or from understanding of the local behaviour of the surfaces. 
Finally, we make a conjecture about ranks of elliptic surfaces over finite fields, and 
highlight some experimental evidence supporting it. 


1 Introduction 


Let € be an elliptic surface over Q. Fix a Weierstrass equation 

E:y?=x°+ A(T)x + B(T) 
with A(T), B(T) € Z[T] and 4A(T)? + 27B(T)* ¥ 0. Given an integer t such 
that 4A(t)? + 27B(t)? + 0, let €; denote the elliptic curve over Q with Weierstrass 
equation 


Ep: y7 =x + A(t)x + B(t). 


Given an elliptic curve E/Q and a prime p, define ap(£) to be the pth coefficient 
of the L-function attached to E. 
It was conjectured by Nagao [11] that 
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log p 
a 


1 P 
rank E(Q(T)) = — Jim > D7 Dap (E) (1) 


p<X 1S] 


Rosen and Silverman [12] proved that (1) held if one assumed Tate’s conjecture 
and that a certain L-function didn’t vanish on the right edge of its critical strip. In 
particular, (1) holds unconditionally for rational elliptic surfaces. 

In light of (1), we think it’s likely that the average rank of elliptic surfaces over 
Qis 0, essentially because the average value of a, (€;) should be 0. We give a more 
thorough justification for this belief in Sect. 2. 

There are several ways the belief that the average rank of elliptic surfaces is 0 can 
be formulated as a precise conjecture. For convenience, we introduce the following 
framework for discussing possible formulations of such a conjecture. 

Let S(M) with M € Z.o be a sequence of subsets of the set of all elliptic 
surfaces over Q, with the properties that S(M) is finite for every M, and S(M) C 
S(M") whenever M < M’. We'll say that S = US7_,S(M) is a family of elliptic 
surfaces, and the filtration given by M is an ordering of that family. This parallels 
the language used when discussing statistics of ranks of elliptic curves, where, for 
example, people might discuss “the family of quadratic twists of a fixed elliptic 
curve, ordered by conductor”. 

Define the average rank of the family S to be the quantity 


1 
jim, asap 2 rank €(Q(T)). 


EES(M) 


We believe that the average rank of many families S will be 0. In formulating 
Conjecture 1.1 below, we pick a specific family for the sake of concreteness, but 
there are many other reasonable choices. 

Conversely, there are several examples in the literature of families of elliptic 
surfaces over Q which were constructed to have strictly positive rank [5, 6]. Average 
ranks of the elliptic curves coming from specializations of elliptic surfaces have 
been studied as well [1, 6, 14], and in some situations the average ranks of the 
specializations can exhibit rich and possibly unexpected behaviour. 

To formulate Conjecture 1.1 below, we first define, for positive integers d and M, 
the set 


Pa(M)={p € Z[T] : deg(p) = d, w(p) < M}, 


where j4(p) is the Mahler measure of p (using the height of p would also be 
reasonable here). Set 


Smn(M) = {Er y* =x3+A(T)x+B(T) : A € Pyy(M?), BE Py(M?), 4A(T)? +27B(T)? ¥ 0}. 


Then we believe the average rank of the family S = S,,, will be 0 whenever m 
and n are both positive, i.e. 
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Conjecture 1.1 For any fixed positive integers m and n, we have 


li >) tank E(Q(T)) = 0 
M=00 4S, n(M) a = 
, EESmn (M) 


In Sect.3 we discuss the main obstacles for proving Conjecture 1.1 using the 
analytic framework in [12]. In Sect.4 we outline an approach to Conjecture 1.1 
based on investigating statistics of ranks of elliptic surfaces over finite fields. 


2 Probabilistic Heuristics 


For a fixed prime p, Birch [2] gave all moments of the distribution of ap(£) when E 
is chosen by selecting a (possibly singular) Weierstrass equation with coefficients in 
F,, uniformly at random. Let (A p,,) be a sequence of independent random variables 
indexed by a prime p and a positive integer t, with the property that, for every f, 
the random variable A,; has the same distribution as the values ap(E) for E a 
Weierstrass equation in F, chosen uniformly at random, as in Birch’s work. We 
highlight the following property of the sequence (A,,;): 


Proposition 2.1 For any ¢ > 0, the series 


converges to 0 with probability 1 as X goes to infinity. 


Proof By construction we have 


1 1 I 2 
var( any oe “EP ) ge. 
X2zt7 Pp Are Dp 


Thus 


1 a log p il 
Var| —— )) >>) Aps— | K sae DL log p)? «1. 
X2"2 ex t=1 P x x 
p<Xt p< 


By Kolmogorov’s three-series theorem, as stated in [4, theorem 2.5.8] for example, 
it follows that the series 


1 # 1 
py 


X2 p<X t=1 P 
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converges almost surely. Hence the series 


converges to 0 almost surely. oO 


Proposition 2.1 is relevant because it suggests that, if we believe that the values 
ap(E;) are distributed in the same way a,(£) is for elliptic curves E/F, chosen 
uniformly at random, then we should expect that 100% of the time the quantity in 
(1) is 0. 

There is evidence to support the belief that the values ap(€,) will be distributed 
in this way. If, instead of fixing p and letting ¢ vary initially, we fix t and let p vary, 
then, for all « > O the series 


Ap, —— (2) 


converges to 0 as X goes to infinity with probability 1. This suggests that, for a fixed 
elliptic curve E,/Q, we should expect that 


| 
Yaw “<s (3) 
p 


p<X 


for all e > 0, because we should expect that the reductions E; /F , will be distributed 
uniformly at random. The bound (3) was proven by Heath-Brown [8], so the 
reductions E,/F, do indeed behave uniformly randomly in this case. 

The popular belief that the average rank of elliptic curves over Q is 5 can also 
support the idea that these sums of a, values behave randomly in the way described. 
The Birch and Swinnerton-Dyer conjecture (BSD) connects the rank of an elliptic 
curve E/Q to the coefficients a,(£) of its L-function. The original formulation of 
BSD was 
Il pt+l~ay(E) Cr (log x) EQ 
p<X P 


for some constant Cg which depends on E. See work of Goldfeld [7], K. Conrad 
[3], and Kuo-R. Murty [9] for some treatment of this specific formulation of BSD. 
One consequence of this form of BSD is [13] 


1 1 | dx 1 
nk E(Q) = —— = (E) log p — ; 4 
rank E(Q) = 5 ax | = Lawl dog p= +0(— 5) (4) 
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It is widely believed that many families of elliptic curves over Q have average rank 
5. This belief suggests that the quantity 


1 
= 2. ap(E) log p (5) 


p<x 


appearing in Eq. (4) should average to 0 in those families. However, because the 
error term in (4) depends on E, even knowing that the average rank of elliptic curves 
was 5 wouldn’t be enough to conclude that the quantity (5) averages to 0. While it 
would be surprising if the error terms in (4) did not average to 0 as well, controlling 
these error terms is difficult, and is the main obstacle in proving anything about 


average ranks from this perspective. 


3 Obstacles for Analytic Proofs 


In this section we discuss what obstacles exist which make proving Conjecture 1.1 
difficult within the framework established in [12]. In proving Nagao’s conjecture, 
Rosen and Silverman first prove an analytic version of his conjecture: 


P 
] 
Res ) >) ap(Ex) =" = rank E(Q(7)). ©) 


P t=l P 


To do this, they introduce the following notation: 


« L2(E/Q, 5) is the Hasse-Weil L-function of €/Q attached to H? (E/Q, Q). 

* NS(E/Q) is the Néron-Severi group of €/Q. 

* .is the trivial part of NS(E/Q) ® Q, generated by the image of the zero section 
and by all components of all fibers. 

* .A,(1) is the Tate twist .%@ Tz(Q*) of the Gal(Q/Q)-module .Y 

« L(S%(1), 5) is the Artin L-function attached to the representation .% (1). 

L2(E/Q, 5) 

MOLD" TAD) 

Then, assuming the Tate conjecture, Silverman and Rosen prove that both sides of 

(6) are equal to 


d 
Res — log N(E/Q, s). 
s=2 ds 


The conjecture (1) then follows from standard analytic techniques applied to the 
series 4 log N(E/Q, s). 

We now discuss what might be involved in a proof of Conjecture 1.1 using these 
analytic techniques. Suppose that there exist constants d¢ > 0 and C¢ such that 
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p 
= Lavy EP + rank (Q(T) < Cex-®, 


p<X t=1 P 


as is typical in this kind of setting. To prove Conjecture 1.1, it would be sufficient to 
find, for every elliptic surface € € S, areal number X¢ > 0 depending only on €, 
such that both 


1 1 L log p 
lim ——~ Yo — SO Yoap(&) =0 (7) 
eae #S(M) EcS(M) Xe p<X¢g t=1 P 


and 


1 = 
lim —— ers 
M->co #S(M) d, CexXe* =0 (8) 
EeS(M) 


simultaneously. Condition (7) requires that M be “large” relative to the values X¢ 
so that the averages 


1 


Roma a ap (Er) 
oe EeS(M) 


for fixed p and ¢ are small. Condition (8), on the other hand, would like for M to be 
“small” relative to the values X¢. The difficulty, then, is to find some choice of X¢’s 
such that both of these conditions hold at once. We expect that showing that there is 
such a choice will be difficult. Condition (7) might require a Pélya- Vinogradov style 
result, where one shows that there are no “conspiracies” among the values ap (€;) 
that might cause them to behave differently than random variables. Condition (8) 
might require subconvexity bounds, and the number é¢ will depend on the locations 
of the zeroes of N(E/Q, s). 


4 An Approach via Finite Fields 


Let S¢-m,» denote the set of elliptic surfaces €/F» : y? = x3 + A(T)x + B(T) with 
deg(A) = m and deg(B) = n. This set is finite. Let pg(m, n) denote the proportion 
of elliptic surfaces in Sp.m_» which have positive rank. 


Conjecture 4.1 For every prime £0, and every pair of integers mo, no > 0, 


: : 7 1 
lim pe(mo,no) = lim pg y(m,no) = lim pe)(mo,n) = =. 
l—>00 m—>0oo n—>0o 2? 
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This conjecture, beyond being interesting in its own right, provides an approach for 
proving Conjecture 1.1. See [10] for experimental evidence towards Conjecture 4.1, 
where p¢(m, n) is estimated computationally for 2 = 7,n = 6, 12, 18, 24, 30, and 
m <n/2. 

Let N be a squarefree positive integer. Let S ‘Y)(M) denote the subset of S(M) 
for which gcd(N, 4A(T)? + 27B(T)*) = 1. Then, for any m, n, and M, 


#H{E € SS) (M) : E/F¢ has positive rank for all £|N} 
#S\)(M) 


=|] petn,n) + OM) 


£\N 
(9) 


by the Chinese remainder theorem, where €/IF,¢ denotes the reduction mod ¢ 
of €/Q. 

If €/Q has positive rank, then either the reduction €/F, has positive rank, or 
the kernel of the reduction E(Q(T)) > E(F¢(7)) is of finite index in €(Q(T)). 
If this kernel was never of finite index then Conjecture 1.1 would follow from 
Conjecture 4.1 (as well as much weaker versions of this conjecture), via observation 
(9). The kernel of the reduction €(Q(T)) > E(F¢(T)) is of finite index in €(Q(T)) 
occasionally, but presumably not nearly enough for this approach to fail. However, 
proving as much seems difficult. The generators of €(Q(T)) will map to the identity 
of E(Fe(7)) if their denominators are divisible by 2, so one is naturally lead to 
investigate the dependence of the height of the generators of €(Q(7T)) on the size of 
the coefficients of the Weierstrass model of €/Q. 
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On Rational Bianchi Newforms and M®) 
Abelian Surfaces with Quaternionic cen 
Multiplication 


J. E. Cremona, Lassina Dembélé, Ariel Pacetti, Ciaran Schembri, 
and John Voight 


Abstract We study the rational Bianchi newforms (weight 2, trivial character, 
with rational Hecke eigenvalues) in the LMFDB that are not associated to elliptic 
curves, but instead to abelian surfaces with quaternionic multiplication. Two of these 
examples exhibit a rather special kind of behaviour: we show they arise from twisted 
base change of a classical newform with nebentypus character of order 4 and eight 
inner twists. 


2010 Mathematics Subject Classification Primary: 11G05, Secondary: 11G40 


1 Introduction 


Let f be a classical newform with weight 2 and level N. When f has rational 
coefficients, the Eichler-Shimura construction attaches to f an elliptic curve Ef 
defined over Q, a quotient of the Jacobian of the modular curve X9(N), such that 
L(f,s) = L(E¢, 5). More generally, if f has coefficients in a number field Mf of 
degree d, this construction associates a simple abelian variety A of dimension d 
defined over Q, now a quotient of the Jacobian of the modular curve X,(N), 
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such that End(Ays) @ Q ~ My and L(Ay,s) = TI, L(o(f), 5), the product 
over all embeddings 0: My < C. (It may happen that Ay acquires additional 
endomorphisms over a number field, and in particular A ¢ may not be geometrically 
simple.) 

It is natural to seek such a relationship for modular forms over other number 
fields. In the case of totally real number fields, in many instances (for example, over 
fields of odd degree) a construction analogous to the Eichler—Shimura construction 
is obtained by replacing the modular curve by a suitable Shimura curve. However, 
a general such construction over number fields is not known. So with the simplest 
case in mind, we are led to ask: given a Bianchi newform F of weight 2 over an 
imaginary quadratic field K, is there an abelian variety A defined over K whose 
L-series matches that of F (as above)? 

A precise answer to this question, consistent with the predictions of the Lang- 
lands philosophy, is provided affirmatively by a conjecture of Taylor [Tay95, 
Conjecture 3]—but with a wrinkle. Let Mr be the number field generated by the 
Hecke eigenvalues of F, and let d = [Mr : QJ]. Then conjecturally there is an 
abelian variety Ay of dimension 2d defined over K with quaternionic multiplication 
(QM) by a quaternion algebra Br over Mf (and defined over K) such that 


L(Ar,s) = I] L(o(F),s)°. (1.1) 


o:MroC 


In this formulation, it may happen that the quaternion algebra Br ~ M2(Mf) is 
split, in which case Ar ~ (Ap)? is isogenous over K to the square of an abelian 
variety A’, of dimension d and L(A’,,s) = [[, L(o(F),s). (In general, if the 
base field K has a real place, then Br is necessarily split, which explains the 
simplifications above.) 

In particular, if F has rational coefficients (i1.e., Mr = Q), then attached to F 
is conjecturally a QM abelian surface that is often the square of an elliptic curve. 
Examples of the phenomenon where the quaternion algebra is a division algebra 
were exhibited by Cremona [Cre84, Cre92] in the case where F is the base-change 
of a classical newform with quadratic coefficients, or a quadratic twist of such 
a form. More recently, Schembri [Sch19] exhibited examples which are not base 
change. QM abelian surfaces have been called false generalized elliptic curves by 
Taylor [Tay95], false elliptic curves by Serre (as attributed by Deligne—Rapoport 
[DR73, p. DeRa-12]), and fake elliptic curves by others. Perhaps this terminology 
would be more appropriate for the non-existent geometric object whose L-function 
is equal to L(F,s). 

Given that the situation is rather unusual, in this paper we study it explicitly. 
We consider the Bianchi newforms F of weight 2 with rational coefficients in the 
L-Functions and Modular Forms Database (LMFDB) [LMF20]. For thousands of 
such forms, we can explicitly attach an elliptic curve Er such that L(F,s) = 
L(Er,s), with Er unique up to isogeny over K. For the remaining forms, we can 
find no such elliptic curve—in Sect. 2, we list these forms and summarize what is 
known about them. In all these remaining examples, we can attach a QM abelian 
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surface A over K such that L(A,s) = L(F,s)*, in agreement with the above 
conjecture. 

Two examples in the list stand out, and because of their exotic properties they 
command the majority of our attention here. We consider the rational Bianchi 
newform F of weight 2 with LMFDB label! 2.0.7.1-30625.1-c over K = Q(./—7), 
along with a certain quadratic twist G, which has label 2.0.7.1-30625.1-e. The 
newform F has rational coefficients and is a twist of a base-change form, but there 
is no quadratic twist of F which is base-change; in fact the simplest twist which 
is base-change is by a character of order 8, so has neither trivial nebentypus nor 
rational coefficients. 

Our main result (combining Proposition 6.2, Theorem 6.9, and Proposition 7.5) 
is the following. 


Theorem A Let F be the rational Bianchi newform over K := Q(/—7) of weight 
2, level (175), and trivial character with LMFDB label 2.0.7.1-30625.1-c. Then the 
following statements hold. 


(a) There exists a classical modular form f of weight 2, level 1225, character of 
conductor 35 and order 4, and Hecke field Q(é24), such that 


F= fx @y, 


where fx is the base change of f to K and yf is a character of conductor 
(5./—7) and order 8. 

(b) There exists an explicit genus 2 curve Xr over K whose Jacobian Jr := 
Jac(X ¢) has QM by a quaternion algebra of discriminant 15 and 


Lyp(Xr,T) = Lp(Jr,T) = Lp (F, TY 


for all primes p 4 (2), (5), (./—7). Moreover, all endomorphisms of Jr are 
defined over K. 

(c) Let A be the abelian eightfold over Q attached to f. Then End(A ¢) = Z[24]. 
Let A’ := (Ap)? be the base change of Af to K twisted by f (relative to 
an automorphism of Ay of order 8). Then there is an isogeny A’ ~ i x ie 
over K, where G = F ® w+ is a quadratic twist of F (with LMFDB label 
2.0.7.1-30625.1-e), and Jc the corresponding quadratic twist of Jr. 


The form is described in detail in Sect.4, and the twists are given explicitly in 
Sect. 6. An equation for the curve X - in Theorem 1(b) is given in (7.4). 


‘Here, 2.0.7.1 is the LMFDB label for the base field K = Q(./—7) and 30625.1 the label for 
the level ideal (175), which has norm 30625. The final c is the alphabetic label for this specific 
newform at that level. We use either full labels such as 2.0.7.1-30625.1-c for Bianchi newforms, or 
the shorter version 30625.1-c which omits the field when that is clear from the context. 
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Contents The paper is organized as follows. In Sect.2 we survey Bianchi new- 
forms in the LMFDB. After quickly setting up notation for characters in Sect. 3, we 
describe the classical modular form f in Sect. 4; then we consider its base change 
and twists in Sects. 5-6. We then give a model for the curve X- in Sect.7 and 
in Sect.8 briefly describe related Bianchi modular forms in the same level. We 
conclude in Sect. 9 with some connections to the Paramodular Conjecture. 


2 Rational Bianchi Newforms with No Associated Elliptic 
Curve 


As of August 2020, the LMFDB [LMF20] currently contains rational Bianchi 
newforms over each of the five fields K = Q(V/d) ford = —3, —4, —7, —8, —11, 
as computed by Cremona [Cre84], of all levels of norm up to a bound depending 
on the field, currently (August 2020): 150,000 for d = —3, 100,000 for d = —4, 
and 50,000 for d = —7, —8,—11 (see Table 2.1). The total number of rational 
newforms within these ranges is 185,581, of which 5917 arise from base change 
from Q. All but 40 of the 185,581 newforms F have a candidate associated elliptic 
curve F' defined over the field K in the sense given in the introduction, namely 
L(E,s) = L(F,s); or equivalently that they have the same compatible system 
of £-adic Galois representations. It is possible (in principle) to prove modularity 
of all these elliptic curves, for example using the Faltings—Serre—Livné method 
(as detailed by Dieulefait-Guerberoff—Pacetti [DGP10]). This is being carried out 
systematically by Mattia Sanna (Warwick) for his PhD thesis [San20], using a 
combination of 2-adic and 3-adic methods combined with recent modularity lifting 
theorems. 

Of the 40 remaining rational Bianchi newforms without associated elliptic 
curves, six arise from base change of classical modular forms over Q with trivial 
character, real quadratic coefficients and inner twists: see [Cre92]. A further 18 
are twisted base change, and for 16 of these 18, the base-change form itself and 
the form over Q have trivial character; the other two will be studied in this paper. 
The remaining 16 forms are associated to abelian surfaces over the base field with 
quaternionic multiplication (QM), and were considered by Schembri [Sch19]. 

There is a natural action of Gal(K |Q) = (t) on rational Bianchi newforms, 
f  t(f), where ap(t(f)) = arp)(f); if g = t(f) then we say that f, g 
are Galois conjugate. (Our forms have rational coefficients, so there should be no 
confusion with other Galois actions.) 

Organizing according to field and level, we have the following further details 
about the 40 rational newforms with no associated elliptic curve. These are either 
base-change, or come in Galois conjugate pairs. In some cases the forms mentioned 
are twists of base-change, but the base-change newform itself has too large a level 
to be in the current database, so does not yet have a label. 
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Table 2.1 Summary of Bianchi modular form data in the LMFDB 
Field discriminant 4 8 3 vi 11 Total 
Level norm bound 100000 | 50000 | 150000 | 50000 | 50000 
Rational newform counts 
All rational newforms 40030 | 36843 | 42343 | 35824 | 30541 | 185581 
Not twist of base change 38844 | 35536 | 40986 | 34830 | 29468 | 179664 
Base change of rational newform 908 809 955 518 504 3694 
Base change of quadratic newform 16 10 28 24 13 91 
Twist of base change of rational 240 350 328 378 492 1788 
newform 
Twist of base change of quadratic 2” 138 46 70 64 342 
newform 
Twist of base change of 
higher-dimensional newform g 0 ” 2 ? 3 
Rational newforms with no elliptic 
curve 
Total 4 8 21 7 0 40 
Quadratic twist of base change 0 8 9 5 0 22 
Higher order twist of base change 0 0 0 2 0 2 
Not twist of base change 4 0 12 0 0 16 


¢ K = Q(V-—1), LMFDB label 2.0.4.1: four cases, none of which are base-change 
(even up to twist). 


°K 


34225.3-a is not a base-change or twist of base-change. It is associated to a 
simple abelian surface with QM over K, the Jacobian of the genus 2 curve C 
found by Schembri [Sch19, Theorem 1.1]. 

34225.3-b is a quadratic twist of 34225.3-a. 

34225.7-a and 34225.7-b are Galois conjugates of the previous two. 


= Q(/-—2), LMFDB label 2.0.8.1: eight cases, all base-change (up to 


twist). 


5625.1-b and its Galois conjugate 5625.3-b are both twists of base-changes of 
classical newforms in $2(28800). 

6561.5-a and its Galois conjugate 6561.5-d are both twists of a Bianchi 
newform at level (2592), which is a base-change from $2(20736). The 
classical newform has coefficients in Q(v'15) and attached to the Bianchi 
newform is an abelian surface with QM of discriminant 10, but an explicit 
equation is not known. 

21609.3-b and 21609.3-c are both twists of a Bianchi newform at level 
(14112), which is a base-change from $2(11289). The classical newform 
has coefficients in Q./7 ) and attached to the Bianchi newform is an abelian 
surface with QM of discriminant 14. Again, an explicit equation is not known. 
21609.1-b and 21609.1-c are Galois conjugates of the previous two. 
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¢ K = Q(/-—3), LMFDB label 2.0.3.1: 21 cases, of which 12 are not twists of 
base-change but have associated genus 2 curves with QM: 


61009. 1-a is associated to the genus 2 curve C2 found by Schembri [Sch19, 
Theorem 1.1], while 61009.1-b is a quadratic twist of this, and 61009.9-a and 
61009.9-b are their Galois conjugates. 

67081.3-a is associated to the genus 2 curve C3 found by Schembri [Sch19, 
Theorem 1.1], while 67081.3-b is a quadratic twist of this, and 67081.7-a and 
67081.7-b are their Galois conjugates. 

123201.1-b is associated to the genus 2 curve C4 found by Schembri [Sch19, 
Theorem 1.1], while 123201.1-c is a quadratic twist of this, and 123201.3-b 
and 123201.3-c are their Galois conjugates. 


The other nine cases are twists of base-changes of forms with trivial character, 
coefficients in a real quadratic field and an inner twist; in these cases, explicit 
equations are not known: 


5625.1-a is the base-change of 225.2.a.f, which has CM by —15, coefficients 
in Q(/5), and an inner twist. This is an example of the phenomenon first 
described by Cremona [Cre92]. 

6561.1-b is base-change of 243.2.a.d, with coefficients in Q(v6) and inner 
twist, attached to an abelian surface with QM of discriminant 6. 

30625.1-a and its Galois conjugate 30625.3-a are twists of base-change of 
forms in $2(11025) with CM by —15. 

50625.1-c and its Galois conjugate 50625.1-d are base-changes of 675.2.a.1 
and 675.2.a.m respectively, with coefficients in Q(/2) and inner twist, 
attached to abelian surfaces with QM of discriminant 6. 

65536.1-b and its Galois conjugate 65536.l-e are twists of a form at 
level (768), which is base-change of 2304.2.a.p, with CM by —6, coefficients 
in Q(/2) and inner twist. 

104976. 1-a is base-change of 972.2.a.e, with coefficients in Q(V2) and inner 
twist, attached to an abelian surface with QM of discriminant 6. 


«© K = Q(V-7), LMFDB label 2.0.7.1: seven cases. Of these, two are the forms F 
and G in Theorem | and discussed in detail in Sect. 8 below. In addition, we 
have 


30625.1-d: see Sect. 8. 

40000. 1-b and its Galois conjugate 40000.7-b are twists of 30625.1-d. 
10000.1-b and its Galois conjugate 10000.5-b are twists of forms which are 
the base-change of forms in $2(19600) with coefficients in Q(V/5) and inner 
twist. 


© K = Q(V-11), LMFDB label 2.0.11.1: none. 
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3 Characters 


We begin by setting some notation, in particular we define some characters. Let Q”! 
be an algebraic closure of Q. By a character over Q we will mean either a Dirichlet 
character or the character of the absolute Galois group Galg := Gal(Q* |Q) 
attached to it via class field theory; we define the same over a number field K 
allowing Hecke characters (of finite order). If x is a character over Q, we may 
talk of its restriction x|, to K thinking of it as a character of Galg and restricted 
to Galx := Gal(Q® | K). 

Let N := 1225 = 5272; we now define some Dirichlet characters of modulus 
N. Denote by x5 the mod 5 cyclotomic character of order 4 (with LMFDB 
label x1225(393, -)) and by x_7 the quadratic character of conductor 7 (label 
X1225(1126, -)). As characters of Galg, these cut out the cyclic extensions Q(é5) 
and 


K := QW-7) (3.1) 


respectively. Define ¢ := x5x_7 (label 1225 (293, -)); then ¢ has conductor 35 and 
order 4. Let 


H := (ys) x (x~7) © Z/4Z x Z/2Z (3.2) 


be the group generated by x_7 and xs. 

After restricting to K, the character x_7|x is trivial, so €|x = x5|x. Also, while 
Xs is not a square (since Q has no cyclic extension of degree 8 ramified only at 5), 
its restriction to K is a square, since 5 is inert in K. We denote by y one of the two 
order eight characters of K such that ~~? = x5| K = &|x; the other such character 
is >. The conductor of w is (5./—7). 

The reason why the character y must ramify at the prime ./—7 can be explained 
as follows. The multiplicative group of the residue field of the prime ideal (5) is 
F3, and so admits an order 8 character ws. Such a character is odd, 1.e., Ws5(—1) = 
—1.A finite order Hecke character has infinity type (0, 0) (i.e. it is trivial at the 
Archimedean places), hence if it is unramified outside 5, such a character would 
not be trivial at —1. However, the quadratic character w JI is also odd, and the 


character y corresponds to the Hecke character ramified at {5,./—7} with such 
ramification at both primes. 


4 A Classical Newform of Level 1225 and an Abelian 
Eightfold 


In this section, we investigate a certain special classical newform; all statements 
below can be confirmed using MAGMA [BCP97] or PARI/GP [PAR21]. 
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We retain notation from the previous section. Consider the newspace 
So(N, €)"°™. Its relative dimension as a vector space over Q(e) ~ Q(i) is 56, and it 
decomposes under the action of the Hecke algebra into 7 irreducible components, 
of relative dimensions 2, 2, 4, 8, 12, 12, 16. Their absolute dimensions, as Q-vector 
spaces, are obtained by multiplying by [Q(e) : Q] = 2. 

Let f be any one of the four Galois conjugate newforms in the unique 4- 
dimensional component; these four form a single orbit under Gal(Q* | Q(e)). Of 
the eight Gal(Q* | Q)-conjugates of f, four are in S2(N, €) and the other four in 
So(N,€), where € = ea X5X-7. The form f does not have CM and its Hecke 
field is Mf := Q(£24). 


Remark 4.1 The form f has Nk? = 4900 > 4000, and so is just outside the current 
range of forms in the LMFDB. 


Proposition 4.2. For each character x € H, the twist f ® x is a conjugate of f, 
i.e., there exists o € Gal(M ¢ | Q) such that f ® x =a(f). 


Proof First observe that the level N is divisible by the square of the conductor of ¢, 
and also that neither y_7 nor x5 is a square. This implies that the twists have the 
same level as N [Shi94, Proposition 3.64]. 

Let x € H. Then the nebentypus character of f @ x is ¢x*, which equals either 
€ or €. Hence the twists are newforms in either $2(N, ¢)"*™ or So(N, €)"*”. The 
Galg-orbit of f ® x has size at most 8 (but might a priori be smaller). However, 
ar(f) = -¢, is an eighth root of unity, while the a2 coefficients of all the newforms 
of smaller dimension in S2(N, ¢) (which lie in Q(¢12)) are not roots of unity, so 
cannot be equal to x (2)ap»(f). Hence f @ x has the same Hecke field as f and by 
the uniqueness of the Q-dimension 8 component of S2(N, ¢)"*” must be conjugates 


of f. 


Proposition 4.2 says that the newform f has eight inner twists (o, x) (including 
the trivial twist, where both x and o are trivial). To keep track of these inner twists, 
we use the following notation: let og € Gal(Q(¢24) | Q) be the automorphism such 
that 4 (f24) = ¢5,. Checking the first few Fourier coefficients ap(f) we see that the 
inner twists are defined by the data (o, x) in Table 4.1. 

By the Eichler-Shimura construction, attached to (the Galois orbit of) f is a 
simple abelian 8-fold A = A+, defined over Q with End(A)g := End(A) ® Q ~ 
M;. Write Ax := A xq K for the base extension of A to K = Q(/—7), and 
At := A xQ Q for the base extension of A to Q*!. To determine how A splits 
over Q* we must determine the geometric endomorphism algebra End(Aga)a, 
which is generated over Q(¢24) by endomorphisms arising from inner twists of f, 


Table 4.1 Inner twists of f 


ph 3 2, 3 
x i Xs (Xs 1x5 X=1 X5X_7 X5 X71 X5 X71 
o oO! 019 013 O7 O17 O11 05 023 
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by Ribet—Pyle [Rib80, Rib04, Pyl04] theory. We use the implementation in MAGMA 
[BCP97] due to Quer [Que09] and Stein. 

Proposition 4.4 The following statements hold. 


(a) Each inner twist (0, x) of f gives rise to an endomorphism &(g, x) of A®', defined 
over the field Q(x) cut out by x and satisfying 


aE (c,x) = §(c,x) 0 (@) (4.5) 


for all a € Q(é24). The endomorphisms &(,,) are a Q(624)-basis for End(A®)g. 
(b) We have End(Ax)q ~ M2(Q(ég)), with centre Z(End(A x )) = Zé]. 


, 3,5\., nies : 
(c) We have End(A®)g ~ M4(B) where B ~ ( O ) is the division quaternion 


algebra of discriminant 15 over Q. All endomorphisms of A®! are defined 
over K' := Q(és5, /—7). 

(d) There exists a simple abelian surface S defined over K with End(S)g ~ B and 
an isogeny A ~ S* defined over K'. 


Proof Statement (a) is an application of work of Ribet [Rib80, (5.5)] together 
with the computation of inner twists above (Proposition 4.2). Statement (b) follows 
from (a): over K = Q(./—7), the abelian variety Ax obtains the endomorphism 


§(o17,x_7)3 Since the fixed field of o17 is Q(¢g) and Q(é24) = Qlég)(/—3), we 
conclude that 


—3,-7 
End(A aes ~M : 4.6 
nd(Ax)Q ( Os) ) 2(Qés)) (4.6) 
We have ¢g3€ End(Ax) because already Z[f24] C End(A), and Q(fg) C 


Z(End(Ax)qg). 

We finish (c) and (d) following Pyle [Pyl04]. Over Q*!, we have A ~ S” where 
S (over Q*) is simple, called a building block. We have dim S = 1 or 2, and 
these cases are distinguished by a Brauer class; MAGMA confirms that dim S = 2, 
r = 4, and therefore End(S)g ~ B. Moreover, calculating the obstruction to 
descent [Pyl04, Proposition 5.2], we find that S and its endomorphism algebra may 
be defined, up to isogeny, over K. 


The inner twist relation f ® x = o(f) implies, on comparing the nebentypus 
character on both sides, that 


ex? = a;(€) (4.7) 
and that for all but finitely many p, we have 


ap(f)xX(P) = op (f))- (4.8) 
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Together these imply that ap(f)?/e(p) is fixed by all o € Gal(Q(é24)/Q), so we 
conclude that ap(f ae é(p) € Q for all but finitely many p (and this relation holds 
in generality). 

The L-function of A (over Q) is 


L(A,s) = |] LUf @x,), (4.9) 


xeH 


since the 8 Galois conjugates of f are precisely the 8 twists f ® x for x € 
H. After base change to K’ = Q(¢5, /—7) to trivialize all the characters, by 
Proposition 4.4(d) we find that 


L(A, K',s) = L(S, K',s)* = L(fx’, s)® (4.10) 
so that 


L(S, K',s) = L(fx’, s)’. (4.11) 


5 Base Change to Q(v —7) 


In light of the results of the previous section, we now move to K = Q(./—7). By 
Proposition 4.4(b), we have 


Ax ~ CG, G1) 


where Co is a simple abelian fourfold defined over K with End(Co)g ~ Q(ég). 

We now consider the base-change fx of the newform f to K, a Bianchi modular 
form of weight 2. In the classical language of automorphic forms, the base change 
is known as the Doi—Naganuma lift [DN70] and can be obtained using the theta 
function method as generalized to arbitrary weight, level, and character by Friedberg 
[Fri83, Theorems 3.1, 3.2]. More generally, base change can be understood in 
the language of automorphic representations: see Gerardin—Labesse [GL79] or 
Langlands [Lan80]. 

Although we do not use it explicitly, we recall the formula for the Hecke 
eigenvalues of fx. If p splits in K and p lies over p then dp(fx) = ap(f); if 
p is inert in K then a(p)(fx) = fig fy — 2e(p)p. 

Recall the character y of order 8 defined in Sect. 3. 


Proposition 5.2. The base-change fx has the following properties. 


(a) We have fx € So(0,W~*), so fr has level N = (175) = (5)*(—7)* and 
nebentypus character €\x = w~?. 
(b) The Fourier coefficients of fx lie in Q(és). 
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(c) For each inner twist (0, x) of f, the base change fx has inner twist 
by (0, x|x); so fx has four distinct inner twists (including the trivial inner 
twist). 


Proof The level of fx is determined uniquely by the properties that it is stable 
under Galois conjugation and of norm N*/ Disc(K)* = 12257/77 = 30625 [Tur18, 
Lemma 5.2], where we use the fact that the —7-twist of f has the same level by 
Proposition 4.2: f has inner twist by (017, x-7). Alternatively, since 1225 = 37 
the level of fx divides (5)2(./—7)” but this is already equal to the conductor of the 
character ¢|x = w~? of fx. (This also follows from the recognition of fx @ y € 
S2(To(9t)) in Proposition 6.2 below.) 

Let (o, x) be any of the inner twists of f, soo(f) = f ® x. Restricting to K 
gives o(fx) = fx © x|x. Taking the inner twist to be (017, x_-7) shows that 
o17(fx) = fx, So the coefficients of fx lie in the fixed field of 017, which is Q(ég). 


After restriction to K, the characters in H coincide in pairs to give all the even 
powers of wy. Hence we have 


4 
L(A, K,s) =] ]Lfe @ v7.5)’, (5.3) 


i=l 


and hence from (5.1) that 


4 
L(Co,s) =] [Lr @¥",5). (5.4) 


i=l 


6 Twisting 


We now investigate twists of the base change form fx € S2(%, yr?) from the 
previous section. Define 


F:=fx®wv and G:=fr@w=Foy'. (6.1) 


Proposition 6.2. The twists F,G € S2(9t) have rational eigenvalues; the form F 
has LMFDB label 2.0.7.1-30625.1-c, while G has label 2.0.7.1-30625.1-e. 


Proof Note that twisting leaves the level unchanged and the twisted forms have 
trivial nebentypus. 

It is enough to show that F is fixed by all o € Gal(Q(ég) | Q), since twisting by 
the quadratic character y+ preserves rationality. Let (o, x) be an inner twist of f. 
Then o( fx) = fx ®@ x|x and 


o(fx @v) =o(fx) @o(W) = fx ® xlko(w). (6.3) 
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Since there is an inner twist for every 0 € Gal(Q(ég) | Q), the result is equivalent to 
establishing the character relations 


w=xlkoW) (6.4) 


for all such x. Inspection of Table 4.1 shows that for a € (Z/8Z)*, 


(Vig = (WO YP = WI = Hoa) (6.5) 


verifying (6.4). 

Having identified a base change, we now examine the space of Bianchi newforms 
of level 9t, weight 2 and trivial nebentypus character, a space with LMFDB label 
2.0.7.1-30625.1. The new subspace has dimension 60 and contains 8 newforms with 
rational Fourier coefficients. Of these one matches F’,, namely 2.0.7.1-30625.1-c, 
while G matches 2.0.7.1-30625.1-e, and is the V/5-twist of F (since y+ = x3|,, 


the quadratic character associated to K (/5) > K). 


Remark 6.6 Itis straightforward to identify whether a Bianchi newform F is a twist 
of base change since if F and t(F) are twist-equivalent then either F' is a twist of 
base change or is induced from a Hecke character [LR98]. Furthermore, for two 
Bianchi newforms F) and F) with trivial nebentypus, if ap (F 1)? = Ap (F2)? for all 
but finitely many primes p then F; and F> are twist-equivalent [Ram00]. 


Remark 6.7 In fact, all eight rational Bianchi newforms in the space S$2(9t) (with 
LMFDB label 2.0.7.1-30625.1) give interesting examples: we briefly describe the 
other six below (Sect. 8). The LMFDB does not contain details of Bianchi newforms 
with nontrivial nebentypus character, such as the form fx itself. 


That the twisted forms F and G have rational eigenvalues means that one can 
reasonably expect the corresponding twist of the fourfold Co to split as a product 
of QM-surfaces. Recall (see after (5.1)) that End(Co)g ~ Q(ég); we choose an 
automorphism € € Aut(Co) of order 8. With this choice, we may twist Co via w 
(relative to €), to obtain a new fourfold defined over K, denoted C: 


c= c™. (6.8) 


Theorem 6.9 There is an isogeny C ~ Sr Xx Sg defined over K, where Sr, Sg are 
abelian surfaces defined over K with QM by the rational quaternion algebra B of 
discriminant 15, satisfying L(Sr,s) = L(F, s)” and L(Sg, 8) = L(G, s)*. 


Proof Let € = | (mod 8) be prime and let VzCo be the ¢-adic Tate representation 
of Co, a free Qe[€]-module of rank 2 where & 4 1]; choosing a primitive eighth 
root of unity ¢g € Qe, we may write 


Velox QD Wer (6.10) 
ke(Z/8Z)* 
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where each W, x is a Qe-vector space of dimension 2 with € acting by the scalar on 
Now V,Co is also a representation of Galx; comparing with the L-series 
decomposition (5.4), as Galois representations we have (up to the choice of ¢) 


Wer = Wei @vrt. (6.11) 
On the twist, we have the Galois representation 


VC = VeCo Que} QelEl(Y) (6.12) 


where Qe[é](v) is Qe[E] where Galx acts via y: explicitly, on the eigenspace of 
Qe[é] where & acts by ae now Galx acts via we, In terms of the splitting (6.10) 
above, we have 


nc= QD wWeeevi=Dwiow (6.13) 


ke(Z/8Z)* k 
and thus 


L(C, K,s)=L(fx @¥,S)Lfx @W°, )LUfx @ Ww’, s)L(fx @ wv, 8) 


= L(F,s)°L(G,s)?. 

(6.14) 

Let L D> K’ = K (6s) be the cyclic degree 8 extension of K cut out by w. By 
work of Kida [Kid95, p. 53] we have 


7 : 
ResrjxCo,, ~K I] co . (6.15) 
i=0 


i.e., the restriction of scalars of the base change of A to L is isogenous over K to the 
product of the twists of A by powers of yw. From the L-series decomposition (5.4) 
and the theorem of Faltings, we conclude that 


d : 
Lice? secp xc (6.16) 
i=0 


Let S be the building block from Proposition 4.4(d); up to isogeny (over K’), 
we may choose S so that End(S) = O, where O C B is a maximal order. Then 
AL& Cie ey oF. so Co.L ~L Se Since Q(V2) splits B and all maximal orders in 


O are conjugate, there exists a € O such that a? = 2; then € | € M>(O) is 
a 


an element of order 8, so we may consider twists of S* by powers of yw. Applying 


356 J. E. Cremona et al. 


the theorem of Kida now to S$”, with (6.15)—-(6.16) we conclude there are isogenies 
over K 


7 7 
[ [082% ~ Resiix SZ ~ Resijx Con ~ T] Oy"? ~ cp x ct. (6.17) 
i=0 i=0 


Since Co is simple, we conclude that S is an isogeny factor of C over K, and C ~ 
S x §’ for another abelian surface S$’. We have ($2) — (sv? since sv) is 
the quadratic twist by w* (corresponding to the quadratic extension K(/5) > K), 
so S) is also an isogeny factor of C over K. Since S is a QM abelian surface, its 
L-series is a square, so by (6.14) we have S # S’, andsoC ~ S x sv), and thus 
up to labelling L(S,s) = L(F, s)” and the result follows. 


Remark 6.18 The argument in Theorem 6.9 can evidently be generalized to more 
general situations. 


7 An Explicit Model for the Building Block 


In light of Theorem 6.9, we now seek explicit models for the abelian surfaces Sr 
and Sg whose L-functions match F and G. Because G is a quadratic twist of 
F, it suffices to exhibit a model for Sr. We refer to Buzzard [Buz97] and Voight 
[Voil9, Chapter 43] as general references on abelian surfaces with quaternionic 
multiplication. 


’ 


Let B := 


) be the quaternion algebra over Q of discriminant 15, and let 


O C B be a maximal order. Then there exists 1p € O such that 2 + 15 = 0, 
a principal polarization on O. The moduli space for principally polarized abelian 
surfaces with QM structure by (O, jz) is a Shimura curve X0(15, 1) defined over Q 
by work of Shimura [Shi67] and Deligne [Del71]. 

An explicit model for X9(15, 1) was computed by Jordan [Jor81, Proposi- 
tion 3.2.1] (see also Jordan—Livné [JL85, Table 1]) and Elkies [Elk98, (76)]): it 
is described by the equation 


Xo(15, 1): (x? + 3)(x? + 243) + 3y* = 0. (7.1) 


In particular, X9(15, 1) is a genus 1 curve with X9(15, 1)(Q) = Xo(15, 1)(R) = @. 
The Jacobian of X (15, 1) is the elliptic curve over Q with LMFDB label 15.a5. 
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The quotient by the Atkin—Lehner involution w 5 gives the following map. 


Xo(15, 1) > X0(15, 1)/wis x P! 


, (7.2) 
J, y) =x°/81 
The Igusa—Clebsch invariants of weights 2,4,6,10 for the ‘universal’ abelian 
surface over X0(15, 1)/wis are given by (/2 : [4 : Ig : 19) where 


Ip = 12(j* + 157? + 105j* + 125j + 10), 


(7.3) 
Ig = 4547 + DG +3 G -)*, 


Io = 9(84j° + 14147 + 8865)? + 111577 + 38957 + 185)(j +3)? — D4, 
Io = 2(7 +3)°G — 1)”. 


The invariants (7.3) were computed by Elkies (2007) using K3 surfaces and by 
Guo-Yang [GY 17, Table 4] using Borcherds products. 

With this data in hand, we look for points on X9(15, 1) over K. The base change 
of its Jacobian to K has rank 1, so we will find infinitely many points. Searching in 
a small box, we find the points (x, y) = (£9/./—7, £180/7) for which j(x, y) = 
—1/7 and normalized invariants (648 : 23625 : 17474625 : —392000000). From 
these invariants, we construct a genus 2 curve X’ over K = Q(/—7) well-defined 
up to twist—we confirm that the Euler factors match up to sign in the sense that 
Lp(X', T) = Lp (f, + T)* for good primes p, and this gives strong evidence that we 
have found a match. 

Next, we identify the quadratic field Ks > K which is associated to the twist. For 
a good prime p, write Ly(X', T)=(1—bpT + Nm(p)T?) so that by = (p)ap(F) 
and (p) = +1. Then (p) = —1 if and only if p is inert in Ks; searching for fields 
with this property supported at primes dividing the discriminant of X’ gives a twist 
X for which the Euler factors match for all p with Nm(p) < 200. 

In this way, we obtain a candidate model X for a curve whose Jacobian 
putatively corresponds to F’; however, this model has enormous coefficients. So 
we seek to reduce these coefficients. We first apply the reduction algorithms of 
Bouyer—Streng [BS15] to reduce the discriminant. We would like to apply the 
Cremona-Stoll algorithm [SC03] to reduce the coefficients of X further, but the 
current implementation requires a real place in the base field. Nevertheless, we can 
apply some reductions by hand, and then iterate. In this way, we find the following 
model for X = Xp: y* = fr(x) where 
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fr(x) := (—1277479451 10654443 10316373455693222445855/—7 

— 76950699888 165822373247468369 13 139396789)x° 
+ (369033561 11088112749998450776069205963943./—7 

+ 74766360407432704741728853365227303755813)x° 
+ (—32191715349438139105751177726148896397205./ —7 

— 160594143536856131940416520163791514260770)x* 
+ (3138757306334269455206726101294212247520/—7 

+ 143070480545726568206381517716426697366880)x° 
+ (872371717006802010044 1878642537682595835./—7 

— 59213195766173130554503672005222304866040).x* 


+ (—4080652149555552661387182770620477619703V/ —7 
+ 104577735708741195 12625 1307618 13845233833)x 


+ 5278401062954824 17952525742290110846145./ —7 


— 49371315805 144553493594855 1740658676743. 
(7.4) 
By construction, we know that Jp := Jac(X) has End(J7) > O. Looking at Euler 
factors we see that J does not have CM, so End(Jr) = O, since O is a maximal 
order. 


Proposition 7.5 For all good primes p we have Ly(Xr,T) = Ly(Jr,T) = 
Ly(F.TY, 


Proof The proof of this proposition is a standard application of the method of 
Faltings—Serre, as explained by Dieulefait-Guerberoff—Pacetti [DGP10]; we only 
sketch some pertinent details here. Attached to F = fx ® yp is a Galois repre- 
sentation pr.2: Galxk — GL2 (Q). The characteristic polynomial of Frobenius at 
primes above 2 and 11 are given by 1 + T + 27? and 1 — 47 4+ 11T*, whose 
roots are different and do not add zero, hence the coefficient field of pr.2 may 
be taken as Q» by Taylor [Tay94, Corollary 1]. Since Jr has QM (defined over 
K) by B of discriminant 15 and 2 { 15, the 2-adic Tate representation V2J- is 
free of rank 1 over B ® Qo ~ Mo(Q>) and so affords a Galois representation 


PJp2: Galx > GL2(Q2). 
The 2-division field K (J7[2]) is defined by the polynomial 


y!?_6y!! 421 y!9_50y?+69y8—42y7—69y°+210y°—190y*+48y>+168y2— 160 +64 
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(as an absolute extension); we confirm this extension is an $3-extension of K, so the 
mod 2 representation 9,9: Galx — GL2(F2) ~ 53 is surjective. 

Looking at Euler factors, the mod 2 representation p..: Galx — GLo(F2) ~ 
S3 attached to F is also surjective. Since the conjugate of F under Gal(K | Q) is 
a twist of F’, the extension K(JF[2]) is Galois over Q with Galois group S3 x 
C2 ~ De, and so arises as the Galois closure of a sextic extension of Q unramified 
away from 2, 5, 7. From the Jones—Roberts database [JR14], there are exactly 5 such 
extensions whose Galois closure contains Q(./—7): 


x® — 3x9 4+ 2x4 — 6x3 + 25x? — 19x + 8, 
ag? a3? ae Dy 2. 

x? 9 4 On = 11g" 4 ox + ie 
x 94? 4 Tin 8x? + 30x 4 24 ts, 
x© — x9 4 x4 — 20x3 + 8x? — 64x + 512. 


Looking at Euler factors, we rule out all but the first one, and then confirm that 
its Galois closure is isomorphic to the one for Jy; this shows that the two mod 2 
representations (7 and p;, 9 are equivalent. 

Since both residual representations are isomorphic with absolutely irreducible 
image, following Dieulefait-Guerberoff—Pacetti [DGP10, Section 2.1] we then 
show that fully the 2-adic representations pr,2 and p,,.2 are equivalent by showing 
there is no obstruction to lifting. Computing a set of obstructing primes (whose 
Frobenius classes generate the Galois group of the maximal exponent 2 extension 
of K (Jr[2]) unramified away from 2,5, 7), we conclude it is enough to check that 
tr pr.2(Frobp) = tr ey,,2(Froby) for a primes p of K dividing 3, 11, 13, 17, 23, 29. 
Having already checked this for all good primes p up to norm Nm(p) < 200, we 
conclude the proof. 


8 Other Rational Bianchi Newforms at Level (175) 


As well as the newforms F and G with labels 2.0.7.1-30625.1-c and 
2.0.7.1-30625.1-e, there are six other rational newforms of the same level, which 
display a variety of phenomena. All the quadratic twists mentioned here are by //5 
unless otherwise specified. 


¢ Newform 30625.1-a is the base-change of classical modular forms in $2(1225) 
with LMFDB labels 1225.2.a.b and 1225.2.a.d. These are —7-twists of each 
other, and associated to the elliptic curves in the isogeny classes 1225.b and 
1225.d. The base-change to K of the curves in both these isogeny classes all 
lie in the isogeny class 2.0.7.1.30625.1-a. 
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¢ Newform 30625.1-b is a base-change, but of a newform with coefficients in 
Q(/2). The associated modular abelian variety is a surface which splits over K 
into the product of elliptic curves, with associated elliptic curves in the isogeny 
class 2.0.7.1.30625.1-b, which are Q-curves but not base-change. 

¢ Newform 30625.1-d is also base-change, but of a newform with coefficients in 
Q(/5) whose associated modular abelian variety is a surface which does not 
split over K. Hence there is no elliptic curve associated to the newform, while 
there is an abelian surface with QM. The newform has CM by —35 (so is its own 
quadratic twist) and is an example of the situation described in [Cre92, p. 411]. 

¢ Newform 30625. 1-f is the quadratic twist of 30625.1-a so has associated elliptic 
curves 2.0.7.1.30625.1-f which are base-change. 

¢ Newform 30625.1-g is the quadratic twist of 30625.1-b so has associated elliptic 
curves 2.0.7.1.30625.1-g which are Q-curves but not base-change. 

* Newform 30625.1-h is the quadratic twist of 1225.1-a at level St = (35). It 
has associated elliptic curves in isogeny class 2.0.7.1.30625.1-h which are base- 
changes of those in isogeny classes 1225.h and 1225.j, and is itself the base- 
change of classical newforms 1225.2.a.h and 1225.2.a.j. 


9 Relation with the Paramodular Conjecture 


We conclude with an application to the Paramodular Conjecture. Recall the 
following conjecture due to Brumer—Kramer [BK 14] and its corrigendum [BK19], 
as amended by Calegari. 


Conjecture 9.1 There is a bijection between the set of isogeny classes of QM 
abelian fourfolds B over Q of conductor N? and the set of cuspidal, nonlift Siegel 
paramodular newforms f of genus 2, weight 2, and level N with rational Hecke 
eigenvalues, up to nonzero scaling. Moreover, if B <> f in this bijection, we have 
the equality 


L(B,s) = L(f,s, spin)’. (9.2) 


As with Bianchi modular forms, the frequently arising case is where End(B)g ~ 
M>(Q), in which case B ~ A? for A an abelian surface over Q with End(A) = Z 
with L(A, s) = L(f, s, spin). (When End(B)q is a division algebra, the nonexistent 
geometric object A’ such that L(A’, s) = Lf, s, spin) could be thought of as a fake 
abelian surface.) 

To our Bianchi newform F, by theta lift one can attach a Siegel paramodular 
form II, with rational eigenvalues [BDPS$15, Theorem 4.1] such that L(F,s) = 
L(II, s, spin). The form IT satisfies the hypothesis of the Paramodular Conjecture 
(Conjecture 9.1), so there should be a QM abelian fourfold attached to IT. Indeed, 
by Theorem 6.9 the Weil restriction of scalars Resx|gSr of Sr from K to Q gives 
the desired geometric object, and 
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L(Resx|QSr,5) = L(Sr, K,s) = L(F,s)? = L(I, s, spin)’. (9.3) 


Remark 9.4 In work of Boxer—Calegari-Gee-Pilloni [BCGP18, Lemma 10.3.2], 
assuming standard conjectures a general argument is given to explain the existence 
of a geometric object attached to an automorphic form on GSp, with rational 
Hecke eigenvalues. In concrete examples, the veracity of the standard conjectures 
is very hard to check; however, when both the automorphic form and (fake) abelian 
surface can be explicitly given, the method of Faltings—Serre can often be applied 
successfully in practice [BPPTVY] to establish an equality of L-functions, such as 
(9.3). 
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Abstract We describe the computation of tables of Hilbert modular forms of 
parallel weight 2 over totally real fields. 


Tables of classical modular forms have been computed for many decades now, 
starting with a table of rational eigenforms (corresponding to modular elliptic 
curves) appearing in the proceedings of the Antwerp conference in 1972 [2]. 
Large tables of classical modular forms also now exist, with the most recent large- 
scale effort available the L-functions and Modular Forms Database (LMFDB) [13] 
described by a team of ten authors [1] along with the fascinating history. 

In this article, we continue this tradition of tabulating automorphic forms on 
Shimura varieties by computing extensive tables of Hilbert modular cusp forms of 
parallel weight 2 over totally real fields. This work is in a sense complementary to 
work [3] on tables of (modular) elliptic curves over Q(v5). Our data is available 
online at the LMFDB [13]. 

In Sect. 1, we give an overview of the basic algorithms employed. In Sect. 2, we 
describe in more detail the implementation in the computer algebra system MAGMA 
[4], providing some documentation and detail about computations in practice and 
a few auxiliary algorithms employed. In Sect.3, we gather the data which was 
computed and discuss the extent of the computation. Finally, in Sect.4 we discuss 
some interesting aspects of the data in detail. 
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1 The Algorithm: An Overview 


In this section, we give a basic overview of the algorithm we employ. For a motivated 
introduction, see the expository work by Dembélé—Voight [9]; for a more basic 
reference on Hilbert modular forms, see e.g. Freitag [10]. 

Throughout, let F' be a totally real field of degree n = [F : Q] with ring of 
integers Zr and let Xt C Zr be an ideal. Let S;,(9t) denote the space of Hilbert cusp 
forms of level 9t and parallel weight k € Z>2 for F, and let S/°“ (90) be the new 
subspace. 

The algorithm we use combines methods of Dembélé [7] and Dembélé—Donnelly 
[8], along with methods of Greenberg—Voight [11] and Voight [16]. 


Theorem 1.1 There exists an (explicit) algorithm that, on input a totally real field 
F and an ideal X%t © Zr, computes as output the systems of eigenvalues for the 
Hecke operators Ty with p { Yt and the Atkin-Lehner involutions Wye with p° || Tt 
that occur in the space S$“ (N). 


In other words, there exists an explicit finite procedure which takes as input the 
totally real field F, an ideal St C Zr (encoded in bits in the usual way [6]), and 
an even integer k > 2, and outputs a finite set of number fields Ef C Q and 
sequences (a (p))p» encoding the Hecke eigenvalues for each eigenform constituent 
f in S,(9D and af(p) € Ef. 

In this article, we consider the first and essential case k = 2; we believe it would 
be an interesting project to extend these efforts to include tabulation of Hilbert 
modular forms of higher weight as well. 

The general strategy of the algorithm in Theorem 1.1 is as follows. Let B be the 
quaternion algebra over F' which is unramified at all finite places and ramified at 
either all or all but one real place according as n = [F : Q] is even or odd. Then by 
the Eichler-Shimizu—Jacquet—Langlands correspondence (for a precise statement, 
references, and discussion see Dembélé—Voight [9, Theorem 3.9]), there exists an 
isomorphism of Hecke modules 


52M S S2(9D 


where Ss? (Jt) denotes the space of quaternionic modular forms over B of weight k 
and level 9t. The description of the space as (St) varies accordingly as n is even or 
odd. Let O C B be a maximal order. 

If n is even, we use Brandt matrices, following the method of Dembélé [7] 
and Dembélé—Donnelly [8]. The Hecke module he (St) is computed via an explicit 
action on maps from the set of right ideal classes in O to P!(Z/9). We refer to 
this as the definite method, since the algebra B in this case is definite. 

If instead n is odd, the algorithm in Theorem 1.1 is supplied by Greenberg— 
Voight [11] and Voight [16]. For simplicity of exposition, assume F has strict class 
number 1; the method works for arbitrary class number. Let OF = {y € O: 
nrd(y) = 1} denote the group of units of O with reduced norm 1. Suppose that the 
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unique real split place of B corresponds to the embedding t44 : B — Mo(R) and 
let 11) = too (OF) /{+1} C PSL2(R). Let Oo(9t) be an Eichler order of level Jt 
and let Pg (90) = too (O9 (ID i )/{+1}. Then by the theorem of Eichler-Shimura, and 
Shapiro’s lemma, we have a further isomorphism of Hecke modules 


~ en al"(1 + 

SPM) > H'(P(), Coind.">, C) 
where + denotes the +1-eigenspace for complex conjugation. The Hecke module 
52 (St) is then computed via this isomorphism as group cohomology, using an 
explicit presentation for the group (1) based on the computation of a fundamental 
domain [15]. We call this the indefinite method. 


Remark 1.2 More generally, one can use a quaternion algebra B over F of 
discriminant , ramified at all or all but one real place, and St an ideal coprime 
to D; then with 3t = DI, the Jacquet-Langlands correspondence yields an 
isomorphism 


SF (OM) > SIH? 7™ 


where $2(9t)2"°" C $y(Nt) denotes the space of Hilbert cusp forms which are new 
at all primes dividing ©. Since D is squarefree, these methods overlap when there 
is a prime p which exactly divides the level St. We use this to verify our calculations 
by computing with both the definite and indefinite method when it applies; for more, 
see Sect. 3. 


2 Computing Systems of Hecke Eigenvalues 


In this section, we discuss the MAGMA [4] implementation of the above algorithms. 
These methods saw a first implementation over Q by David Kohel, and they rely 
heavily on code for quaternion algebras over number fields, due to Donnelly, Markus 
Kirschmer, Nicole Sutherland, and Voight (documented in work of Kirschmer— 
Voight [12]). 

Our algorithm has four main steps: 


. Precomputation 

. Computing a basis 

. Hecke computation via reduction 
. Decomposition into newforms 


BRWN re 


We refer to Dembélé—Voight [9] for an explicit mathematical description; here, 
we provide some additional comments about the implementation. 


Step 1: Precomputation The first step of the algorithm involves precomputation 
steps that only depend on the field F’: that is to say, for all levels St with p || 9%, this 
step need only be performed once. For the definite method, the precomputation step 
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involves determining a set of representatives for the right ideal classes in O up to 
isomorphism. 


e We precompute the theta series of each left order © of each right ideal 
representative as 


#{a € OL) : Trr/gnrd(a) = n} 


for a small set of values n € Zs. This reduces the number of isomorphism tests 
of ideals when computing Hecke operators below, at the marginal additional cost 
of requiring the computation of left orders and representation numbers. 

e As computing inverses and colon ideals can also be time consuming, we compute 
for each representative right ideal J a small element b € J and a left ideal J’ 
of prime power norm such that O,(J)b = II’: then JI’ = J(I~')b, and a 
generates JJ’ if and only if a/b generates JI~!. 


For the indefinite method, we compute a fundamental domain for the Shimura 
curve associated to the maximal order, together with a presentation for the group 
T9(1) [15]. 


Steps 2 and 3: Computing a Basis and Hecke Computation To compute a basis 
of forms and to compute Hecke operators, we need fast algorithms for working 
with P!(Z;-/90, and our efficient implementation treats the case where Jt is prime 
and composite differently: in the latter we index elements [a : b] € P!(Zr/9t) 
according to divisor a = (a) | St. With these representatives in hand, in the definite 
case we list orbits of P!(Z,-/9t) under the action of the stabilizer groups and in the 
indefinite case we record the permutation action in the coinduced module. (If we 
were working in higher weight, and a nontrivial representation V, we would put a 
copy of V on each element of P!(Z/90).) 

For the sake of reliability, we also verify the dimension of the space using the 
dimension formula (which involves class numbers and embedding numbers). 


Step 4: Decomposition into Newforms With Hecke matrices in hand, we now 
outline the procedures used in our implementation to decompose a newspace as a 
direct sum of new simultaneous eigenspaces. Let S = S2(St). We use the following 
properties. 


1. The Hecke operators T, for p coprime to Jt all commute. 

2. The Hecke eigenspaces in S have distinct systems of eigenvalues for the 
operators T, with p coprime to Jt (“multiplicity one”). 

3. We have dim SP-°'4 = 2dim$2(9t/p) — dim $2(Nt/p*), with the last term 
omitted if p” does not divide 9. 


These properties are standard: for (2), see for instance work of Shemanske— 
Walling [14, Theorem 3.6], and (3) follows from standard facts about degeneracy 
maps: the proof is a straightforward generalization of the classical proof where 


F=Q. 
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Using (1) and standard linear algebra techniques, we decompose the space 
into Hecke constituents corresponding to Q-irreducible subspaces (a direct sum 
of eigenspaces). As harmless as it may seem, this step is often the bottleneck in 
computation. 

Finally, we show how to decompose a space into old and new subspaces. For a 
prime p { t, the p-old subspace S?-°!4 of § = $(9V) is the span of the images 
of all degeneracy maps S2(Jt/p) —> S2(9t). The old subspace $°" is the sum of 
the spaces S?-°'4 for primes p | Jt. The newspaces S"” and SP°™ are the unique 
Hecke-invariant complements of S°4 and S?-°!4, respectively. 

Now suppose we are given two Hecke-modules S and So (e.g. So = S2(9t/p)) 
and we wish to compute the Hecke-submodule S°¢ of § consisting of all the 
irreducible Hecke-submodules of S that arise from So via degeneracy maps. In 
addition, suppose we know a priori the dimension of S°4, as in (3). For a prime 
p { MN, the space $4 is contained in the kernel of the characteristic polynomial 
of Tp | So evaluated at T, | S. Since we know the dimension, we may compute 
S°'4 in this manner using finitely many primes p, stopping when the intersection 
has the right dimension. One could obtain the complement S"*” by computing the 
Hecke-invariant complement, requiring extra module-splitting effort. However, for 
present purposes, we do not need to know S"™ as a particular subspace of S; we 
merely need the Hecke action on S"*”, which can be obtained using the quotient 
sso ~ gnew 

Alternatively, we can compute the degeneracy operators themselves, and then 
compute the orthogonal complement. 

Now suppose that S is a new subspace of dimension d. By (1), it contains d 
simultaneous eigenvectors having distinct systems of eigenvalues, which we wish 
to determine. The Hecke algebra T = Z[Ty]n is a commutative Z-algebra which 
is generated by Hecke operators T, with p coprime to Jt. Therefore T @ Q is a 
direct product of number fields, an étale Q-algebra generated by a single primitive 
element. For practical purposes, one may find a primitive element by trial and error 
(for instance taking simple combinations of the first few Hecke operators), since the 
set of non-primitive elements must be sparse. Then the simultaneous eigenspaces of 
S are simply the eigenvectors associated to the primitive element. 


3 Data Computed 


We compute for the totally real fields F of degree n < 6 with discriminant < 2-10”. 
Note that if [F : Q] > 6 then dr > 2. 10” by the Odlyzko bounds; indeed, the 
totally real field of degree 7 of smallest discriminant has discriminant 20134393. 

In this range, the discriminant and degree of the field uniquely characterizes the 
field, with one exceptional pair: in degree 4, there are two fields of discriminant 
16,448: one, with minimal polynomial x* — 2x? — 6x? + 2 has Galois group $4 
and the other x+ — 2x3 — 7x* + 8x + 14 has Galois group Dg. (The first multiple 
discriminant for cubic fields is 3969.) 
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This gives a total of 233676 Hilbert modular forms in the database. To save 
space, we did not include data for conjugate levels; for consistency across the 
database, these were later added to the LMFDB, increasing the overall size of the 
data. 

The computation was performed using the Vermont Advanced Computing 
cluster (VACC). The odd degree fields, which used the indefinite method, took 
approximately 4 CPU years to compute. Much of this time is spent computing 
Atkin-Lehner involutions and coping with higher class number issues. 

The data is hosted at the LMFDB [13] and backed up in a local repository. The 
raw data takes 172 GB, compressed 78 GB. 


Remark 3.1 As in Remark 1.2, in many situations we can employ either the definite 
or indefinite method. For example, if F has odd degree and p || 9t, then one can 
apply the definite method with a quaternion algebra B ramified at all real places and 
the prime p; the Hecke module computed is then S2(9t)? "*’. To compute a single 
space of forms, the definite method runs more quickly than the indefinite method, 
but for tabulation one must use the indefinite method if one wants to compute with 
forms of square level; it is also more efficient to use the indefinite method since for 
each quaternion algebra used one must repeat the precomputation step. 


4 Comments on Data 


Base Change Forms We detect base change forms as follows. First, a necessary 
condition for f to be a base change from a smaller field Fo is that o(t) = Tt 
and dg(p) = 4p for allo € Gal(F'/Fo). If the form fails this test (for all possible 
subfields Fo), it is not a base change form; if it passes this test, then is likely to be a 
base change. (One could detect more generally forms that are twists of base change 
forms.) To be certain and identify such lifts, we could look in the database for forms 
over the field Fo with level supported at primes dividing the relative discriminant 
OFF or Ntp = ZFo fannie 


CM Forms We detect CM forms as follows. Let p be a prime which is trivial in 
the narrow class group, and let E = Q({ap}p)) be the number field generated by 
the Hecke eigenvalues. Then E is totally real, and the characteristic polynomial 
of Frobenius T* — apT + N(p) has discriminant dy = a; — 4N(p) € E. If the 
form is CM and ay ¢ p, then the Frobenius belongs to the quadratic field of this 
discriminant. So if we find two such primes p whose CM discriminants belong to 
different square classes in E* /E**, then the form does not have CM; if these square 
classes agree, then it gives strong likelihood that the form is CM with the given CM 
discriminant K = E (/dp). One can then also verify that ay = O for primes p inert 
in K to provide further evidence. To conclude that the form really is CM, we could 
use the machinery for Hecke characters. 
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Computing the L-function Given the Hecke eigenvalues ay (f) of a Hilbert modular 
form f with trivial central character, we define its L-series via the Euler product 


L(s, f) = | [CU — ap(Nm p)*)! | [Cd = ap (Nm pp) + (Nm py)! 
plm pry 


The function L(s, f) has analytic continuation to all of C and a functional equation 
A(s, f) = (dz Nm))*"Pe(s + 1/2)"L(s, f) = AU —s, f). 


Factorization of New Subspaces The largest Hecke-irreducible space in our data- 
base has dimension 286, and occurs for the field F = Q(./296) and level Jt of norm 
29. It would be interesting to consider the distribution of the dimensions of Hecke 
irreducible subspaces inside the new subspace. 


Endomorphism Algebras We can also see some interesting patterns in the dis- 
tribution of endomorphism algebras of modular abelian surfaces over totally real 
fields. First, we have the following conjecture of Coleman: see e.g. Bruin—Flynn— 
Gonzalez—Rotger [5, Conjecture C(e, g)]. 


Conjecture 4.1 Let F be a totally real field and let g € Z>,. Then the set 
{E = End(X)g : [E : Q] = g and X an abelian variety over F with dim(X) = g} 


is finite. 


In particular, there should be only finitely many real quadratic fields E that 
occur as the endomorphism algebra of a GL2-type abelian surface (g = 2) over 
a fixed totally real field F. (Our data becomes a bit too sparse to say anything 
for g > 3 except for a few fields.) There are a total of 37444 Hilbert modular 
forms whose Hecke eigenvalue field has degree g = 2; corresponding to such a 
form f over F is an isogeny class of abelian surfaces X over F whose common 
endomorphism algebra E = End(X)g is a quadratic field. We consider a refined 
version of Conjecture 4.1 where we consider also quadratic orders, and tabulate 
these. 

Hilbert modular forms over F whose Hecke field E' is an imaginary quadratic 
field correspond to Hecke Grossencharacters. The largest absolute discriminant of 
an imaginary quadratic order observed was —288, occurring for a Hilbert modular 
form over F = Q(/440), a field with class number 2. 

More interesting are the cases when the Hecke field F is a real quadratic field. 
The largest such discriminant of a real quadratic order observed as a Hecke algebra 
was 1260, occurring over the totally real quartic field F of discriminant 11025. The 
distribution of the quadratic Hecke fields for a fixed field F shows rapid decay by 
discriminant; the following histogram of the quadratic Hecke fields for F = Q(/5) 
is typical. 
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The largest discriminant for F = Q(/5) is d = 228, and it occurs just once with 
level norm 3420 (up to the computed level norm 4999). We tabulate in a similar way 
the following maximum discriminant quadratic Hecke fields for the first few F: 


dr |max(d) 
‘5 [228 
8 | 384 
“12 | 192 
“13 [252 
17/192 
21 | 588 
24 | 288 
28 | 448 


Put together, this data gives some weak evidence for Conjecture 4.1—at the very 
least, it shows that the large discriminant fields occur with very low density. 
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Isogeny Classes of Abelian Varieties over M®) 
Finite Fields in the LMFDB spooks 


Taylor Dupuy, Kiran Kedlaya, David Roe, and Christelle Vincent 


Abstract This document is intended to summarize the theory and methods behind 
£q_isog collection inside the ab var database in the LMFDB as well as some 
observations gleaned from these databases. This collection consists of tables of 
Weil g-polynomials, which by the Honda-Tate theorem are in bijection with isogeny 
classes of abelian varieties over finite fields. 


1 Introduction 


The LMFDB (L-Functions and Modular Forms Database) includes a database of 
isogeny classes of abelian varieties defined over finite fields. This database can be 
accessed at https://www.lmfdb.org/Variety/Abelian/Fq/. The purposes of this paper 
are on the one hand to document the theoretical results on which these methods 
depend (Sect. 2) and some of the algorithms used to compile the data (Sect. 3); and 
on the other hand to extract some observations from the compiled data, including 
comparison of some statistical data with relevant heuristics (Sect. 4) and collecting 
some examples pertaining to theorems and conjectures in the literature (Sect. 5). 

For small dimension g and prime power g = p’, the database contains searchable 
lists of complete sets of isogeny classes of abelian varieties of dimension g over Fy. 
Table | shows the range of g and g covered by the database as well as the total 
number of isogeny classes of a given dimension. 

We note that by “Bound on qg” we mean that the set of isogeny classes of abelian 
varieties over F, has been computed for every prime power less than or equal to q. 
In dimensions g = | and 2, isogeny classes of abelian varieties defined over Fy 
for powers of 2, 3,5, and 7 up to 1024 have also been included. The isogeny class 
count is the total count of isogeny classes in the database, including for g a power 
of 2, 3,5, and 7 up to 1024 in dimensions | and 2. 
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Table 1 The number of 


: Dimension | Bound ong | Isogeny class count 
isogeny classes for each - 


dimension u 499 6184 
2 211 1,253,897 
3 25 1,055,307 
4 5 183,607 
5 3 281,790 
6 2 164,937 


For each isogeny class, we report a variety of invariants including the follow- 
ing: 


¢ L-polynomial; 

e Newton polygon (and hence p-rank and ordinarity); 

¢ endomorphism algebra; 

¢ Frobenius angles and angle rank; 

¢ whether the isogeny class contains a principally polarized abelian variety or even 
a Jacobian (when known); 

¢ whether the isogeny class is simple; 

¢ when it is simple, the number field defined by the L-polynomial, and the Galois 
group of its splitting field; 

¢ when it is not simple, the simple isogeny factors appearing in its decomposition; 

¢ whether the isogeny class is geometrically simple; 

¢ the point counts over small extensions of the base field; 

e if the isogeny class contains a Jacobian, the point counts for its curve over 
small extensions of the base field (if the isogeny class is not known to contain a 
Jacobian, the same computation is performed and referred to as the point counts 
of the “virtual curve” associated to this isogeny class); 

¢ whether the isogeny class is a base change of an isogeny class defined over a 
smaller field (i.e., whether the isogeny class is primitive), and if it is not primitive, 
the primitive isogeny classes for which it is a base change; 

¢ the twists of the isogeny class: the isogeny classes to which it becomes isogenous 
after a base change. 


Some of the questions we kept in mind in our analysis of the data are: 


¢ How does the number of objects in the database vary with g and q? 

¢ How are these objects distributed if we sort with respect to the Galois group, 
Newton polygon, or angle rank? 

¢ How is the number of points on the abelian variety distributed? Does this change 
if we fix other invariants? 

e What are the extreme values of the number of points? 


There are, of course, many more questions one can ask, but some would 
require further data compilation; this is especially true for questions regarding the 
distinction between Jacobians of curves and more general abelian varieties. We 
discuss possible future directions of inquiry at the very end of the paper (Sect. 6). 
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We conclude this introduction by directing the reader to some of the highights of 
the paper. 


¢ A tabulation of Galois groups of Weil polynomials (Sect.4.2). We attempt to 
explain the results using Malle’s heuristics on the distribution of Galois groups 
of number fields, but the constants appearing in this method do not fit well with 
the data. 

¢ Verification of some cases of a conjecture by Pries on the existence of abelian 
varieties over F,, with prescribed p-ranks (Problem 4.5). 

¢ A list of possible combinations of p-rank, angle rank and Galois groups for a 
fixed p-rank (Sect. 4.4). This is closely related to the Tate conjecture for abelian 
varieties, as in the work of Lenstra and Zarhin [LZ93]. 

¢ Some analysis of the endomorphisms algebras for abelian varieties over F, for 
small g and gq, in line with a question of Oort (Sect. 4.5). 

e An analysis of point counts on abelian varieties on average over isogeny classes, 
rather than over isomorphism classes (Sect. 4.6). The data suggests a fit not to the 
Sato-Tate distribution, but an alternate distribution which we have not found in 
the literature (called herein the isogeny Sato-Tate distribution). 

e A characterization of maximal and minimal abelian varieties, including an 
explanation of how they are unrelated to maximality and minimality of curves, 
as well as some open questions regarding simple abelian varieties (Sect. 4.7). 

e An example of a supersingular curve of genus 5 in characteristic 3, whose 
existence we were unable to infer from any general constructions (Example 5.11). 

e Some counterexamples against a conjecture by Ahmadi-Sparlinski concerning 
the angle rank of ordinary Jacobians (Example 5.15). 


2 Background 


In this section, we recall various standard facts about abelian varieties and their zeta 
functions. 


2.1 Weil Numbers, Characteristic Polynomials of Frobenius, 
and Zeta Functions 


Here we follow [FO08, pg. 9] (up to a sign convention). Let Q*'® be an algebraic 
closure of Q, Z"!® be the ring of algebraic integers in Q*'8, and w € Z. A Weil 
q-number of weight w is an element a € Q*!2 satisfying: 


1. there exists i € N such that gia € Z!8; and 
2. for any embedding y: Q*'2 > C, |W(a)| = qv”. 
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Hereafter we only consider Weil g-numbers which are effective, meaning that 
a € Z8 (and hence w > 0). 

Such numbers arise from the zeta functions of varieties over finite fields as 
follows: Let X be a variety over Fy, a finite field of cardinality g and characteristic 
p. Then we define 


T 


n 
n 


Z(X/F,, T) := exp So #X (Fyn) 


n>1 


For any Weil cohomology theory X % H™(X) (e.g., €-adic étale cohomology for 
£< p, or Berthelot’s p-adic rigid cohomology), we have 


2 dim(X) ; 
Z(X/Fy,T)= [] Luo, — Ly (1) = det(1 — FT|H™(X)), 


w=0 


where F: X — X denotes the action of Frobenius. 

If X is smooth and proper of dimension n, then the eigenvalues of F on H”(X) 
are Weil g-numbers of weight w; for étale cohomology this is Deligne’s theorem 
[Del74], while for rigid cohomology it follows from Deligne’s theorem by a result 
of Katz—Messing [KM74]. Also, if we define ¢(X,s5) := Z(X/F,,q~°), then we 
have a functional equation 


C(X,n—s) =+(q"/2q7*)* C(X, 5) 


which follows from Poincaré duality (see [FK88, §II.1] for the étale case and 
[Ked06a] for the p-adic case). In the displayed formula above, xX;op is the topolog- 
ical Euler characteristic; if X is the reduction of a variety over a number field then 
Xtop is the alternating sum of the Betti numbers of the cohomology of the complex 
manifold obtained by taking the C-points of the variety in characteristic zero. 

In the special case where X = A is an abelian variety of dimension g, 
dim H'(A) = 2g and H”(A) = AYH'(A) for w = 0,...,2g. We summarize 
this second relation by saying that L,(T) = A”L,(T). We refer to L1(T) as the 
L-polynomial of A and to its reverse det(T — F|H '(X)) as the characteristic poly- 
nomial of A, or the Weil polynomial of A. We use these two terms interchangeably 
throughout the text. 


2.2 Weil Polynomials 


Define a Weil q-polynomial (or Weil polynomial when q is understood) to be a 
monic polynomial over Z whose roots are all Weil g-numbers of weight 1. (Since 
we are exclusively interested in abelian varieties, we consider only Weil g-numbers 
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of weight 1 in what follows.) A standard first step in classifying Weil polynomials 
is the following observation left as an exercise to the reader. 


Proposition 2.1 Let f € Q(T] be an irreducible polynomial with all real roots. Let 
B € R satisfy f (B) = 0. Choose q = p" such that for all Ww: Q(B) > R we have 
w(B)* — 4q < 0. If is a zero of T* — BT +.q = 0, then x is a Weil q-number of 
weight 1. 


As a converse, we have the following characterization of Weil g-numbers. 


Lemma 2.2 (Weil g-number Characterization) Let a be a Weil q-number of 
weight 1. 


1. If Q(z) has a real embedding, thenn = +,/q. 
2. Suppose that w: Q(z) — C is anon-real embedding, then 


(a) B:= 1+ 4q/m is totally real; 

(b) Q(z) is a CM-field with maximal totally real subfield Q(B), and x has no 
real embeddings; 

(c) a isasolution ofl? — pT Le where w(B? —4q) < Oforally: Q(t) > C. 


Proof 


1. In this case, W()* = W(a) W(t) = G. 

2. (a) Since z has absolute value ./q in every embedding, g/m is its complex 
conjugate. 

The fact that Q(z) is CM is [Hon68, Proposition 4], and the second part 
follows. 

Consider the equation 


(b 


wm 


(c 


wm 


T? — BT +q=0. 


Let y = 2 — q/z and note that 6? — 4q = y. The quadratic formula gives 
5(B + y) as the solutions, which simplify to z and g/z. The second half of 
the statement follows from the fact that 2 has no real embeddings. 


Oo 


2.3. Weil Polynomials and Isogeny Classes of Abelian Varieties 


For most classes of varieties over a fixed finite field, among the possible zeta 
functions consistent with the Weil conjectures, it is difficult to predict in advance 
exactly which ones occur. However, for abelian varieties, this question is completely 
solved by the Honda-Tate theorem; we follow the treatment of this result given in 
[WM71]. 
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Let A and B be abelian varieties over F, with characteristic polynomials 
P4(T) := det(T — F|H'(A)), Pa(T) := det(T — F|H'(B)); 


these are Weil g-polynomials. The Honda-Tate theorem makes the following 
assertions: 


¢ Pa divides Pg if and only if B is isogenous (over F,) to a product in which A 
occurs as a factor; in particular, A and B are isogenous if and only if P4 = Pg. 

* If A is simple, then P, = h{ for some irreducible h4(T) € Z[T] and some 
positive integer e = e, which can be read off explicitly from h, (see below). 

e Every irreducible Weil g-polynomial occurs as h4 for some A (which is unique 
up to isogeny). 


We explain the rule for computing e,4 for A simple in the context of analyzing 
the Q-endomorphism algebra E := End°(A/F,). Let z be the class of T in 
Q(z) := Q(T )/(A,4(7)), and identify az with the action of Frobenius in FE. Then E 
is a division algebra with center Q(z) and 


2g = [Q(z) : Qlea, ea = VIE: Q(m)]. 


The Brauer invariant inv, (£) of E at a place v of Q(z) is given as follows: 


¢ For v finite and not lying above p, inv, (E) = 0. 
¢ For v finite and lying above p, 


inv, (E) = —(log, |7|y)[Ky : Qp] (mod Z). 


¢ For v archimedean, inv,(E) = 5 if v is real (which by Lemma 2.2 means 
mw = +,/q) and 0 otherwise. 


The exponent e,4 can now be computed as 
ea = Icdy{inv,(£)}, 


where lcd denotes the least common denominator. In particular, e4, = 1 whenever 
A is ordinary or “almost ordinary” (see Sect. 2.4). 
2.4 Newton Polygons, p-rank and Ordinarity 


Let A be an abelian variety over F, and define Pa as in Sect. 2.3. The normalized 
Newton polygon of Pa is the lower convex hull of the set 


_ordy(ai)\ 
{(7 ee) 11=0,....24] 
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where P4(T) = T78 +a,T78—!+4.- - +49. (This is invariant under base extension.) 
The normalized Newton polygon of Py, is the graph of a piecewise linear function 
on [0, 2g] with changes of slope only at integer values. In particular, on each of 
the intervals [0, 1], ..., [2g — 1, 2g] the function has a unique slope; we call these 
the slopes of Pa(T). For any place v of K = Q(z) above p, the slopes of P4(T) 
coincide with the ratios v(@)/uv(q) as a varies over the roots of P,. 

The normalized Newton polygon of A satisfies the following properties: 


¢ The left endpoint is (0, 0) and the right endpoint is (2g, g). 

e The vertices are all lattice points with nonnegative second coordinate. 

e The vertices are symmetric: (i, j) is a vertex if and only if (2g —i, g — j) isa 
vertex. Equivalently, (i, 7) lies above the polygon if and only if (2g — i, g — /) 
does so. 


We say that any convex polygon satisfying these conditions is eligible in dimension 
g. For N, N’ two eligible polygons, we write N < N’ if N’ lies on or above N, 
and N < N’if N < N’ and N # N’. The eligible Newton polygons in dimension 
g form a partially ordered set with unique minimal and maximal elements. The 
minimal polygon is the one with vertices (0, 0), (g, 0), (2g, g), in which the slopes 
are 0, 1 (each with multiplicity g); when this occurs we say that A is ordinary. 
If we exclude the ordinary Newton polygon, then there is again a unique minimal 
polygon, with vertices (0, 0), (g — 1,0), (g + 1, 1), (2g, g); when this occurs we 
say that A is almost ordinary, as in [LZ93]. The maximal polygon is the one with 
vertices (0, 0), (2g, g), in which the slopes are 5 (with multiplicity 2g); when this 
occurs we say that A is supersingular. 

Define the elevation of an eligible Newton polygon N in dimension g as the 
number of lattice points (i, j) with 1 <i < g, j => 0 which lie strictly below N. 
(This is not standard terminology.) 


Lemma 2.3 Let N,N’ be eligible Newton polygons in dimension g with N < 
N'. Then there exists an eligible Newton polygon N" in dimension g such that 
N < N” <N' and the elevation of N" is one more than that of N. 


Proof Since N < N’, there must exist a vertex (i, 7) of N lying strictly below 
N’; by symmetry, (2g — i, g — j) also lies strictly below N’. Let N” be the lower 
convex hull of the set of lattice points lying on or above N exclusive of (i, 7) and 
(2g —i, g — j); this choice has the desired form. Oo 


Corollary 2.4 The poset of eligible Newton polygons in dimension g is catenary: 
any two maximal chains between the same endpoint have the same length (namely 
the difference in elevation). 


For any positive integer d, by Katz [Kat81] the Newton polygon defines a locally 
closed stratification on the coarse moduli space A, q of g-dimensional abelian 
varieties equipped with a polarization of degree a”. 


Theorem 2.5 Let N be an eligible Newton polygon of dimension g. 
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Lal 


. There is a (nonempty) stratum of Ag,q with Newton polygon N. 

2. Each irreducible component of this stratum has codimension equal to the 
elevation of N. 

3. If N is not the supersingular Newton polygon, then the corresponding stratum is 

geometrically irreducible. 


Proof For the supersingular Newton polygon, this is contained in [LO98, Theo- 
rem 4.9]. For general N, apply Lemma 2.3 repeatedly to construct a maximal chain 
No <--+- < Nj containing NV, where No and Nj are the ordinary and supersingular 
Newton polygons. By the supersingular case, the length of the chain (which is 
the elevation of N;) equals the codimension of each irreducible component of the 
supersingular stratum. 

We now appeal to the de Jong—Oort purity theorem [dJO00, Theorem 4.1], which 
asserts that the Newton polygon stratification on A, q jumps purely in codimension 
1. For each Newton polygon JN in the chosen chain, the union Xy of all strata 
corresponding to Newton polygons on or above WN is a closed subscheme of Ag a; 
the codimension of Xj starts at 0 for N = No, increases by at most | at each step, 
and ends with the elevation of N;. Consequently, it must increase by exactly | at 
each step; from this, the first two claims follow. The third claim is a theorem of 
Chai—Oort [CO11, Theorem A]. oO 


2.5 Galois Groups 


Let a be a Weil g-number. By Lemma 2.2, either 7 = +,/q or Q(z) is CM; 
assume hereafter that the second case occurs. Write Q(z)! for the Galois closure 
of Q(z) and set 2d = [Q(z) : Q] = 7 Let G = Gal(Q(z)#*"/Q), considered as a 
subgroup of Sq by its action on the conjugates of zr. 


Lemma 2.6 Let G be the Galois group of a Weil q-number x. 


1. G is a subgroup of W2q := c > Sq which acts transitively on the 2d roots, 
where here W2g is the wreath product of C2 by Sq. 

2. G contains complex conjugation, which is the unique nontrivial element of the 
center of Wa. 


Proof The fact that G C Wog and contains complex conjugation follows from 
[Dod84, Prop 1.1]. Transitivity follows from the fact that Q(z) is a field, the claim 
that the center of W2q has order 2 follows from its presentation as a semidirect 
product, and that complex conjugation is central is [Hon68, Prop 1 (b)]. oO 


Note that we usually have d = g. In this case, we conjecture that Lemma 2.6 
gives the only constraints on G: 


Conjecture 2.7 Suppose that G is a transitive subgroup of W2, containing complex 
conjugation. Then there is a Weil g-number z such that Gal(Q(zr)&"/Q) = G. 
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Note that the condition on containing complex conjugation is necessary: when 
g = 4 there is a transitive subgroup of Wg (with transitive label 8T14 and abstractly 
isomorphic to S4) that does not contain complex conjugation and thus does not arise 
as the Galois group of a CM-field. 


2.6 Frobenius Angle Rank 


For A an abelian variety of dimension g with L-polynomial L(T) = ee (1—a;T), 
the angle rank of A is the quantity 


6(A) = dima (Spang ({arg(@;) : 1 <i < 2g} U {7})) —1e {0,..., g}. 


Equivalently, 6(A) equals the number of multiplicatively independent elements in 
the set {a],..., Q2¢, qi/ 8 which equals the rank of the Frobenius torus associated 
to A [Chi92]. 

The angle rank detects multiplicative relations among the roots of L; these are 
closely related to exceptional Hodge classes on powers of A.! For example, by a 
theorem of Zarhin [Zar94, Theorem 3.4.3], 6(A) = g if and only if there are no 
exceptional Hodge classes on any power of A. On the other end, 5(A) = 0 if and 
only if A is supersingular; this follows from the fact, proven here in Example 5.1, 
that an abelian variety A is supersingular if and only if 74 = ¢./q for ¢ a root of 
unity. 

Another useful observation is that if A is simple and 0 < 6(A) < g, then 
the inclusion G C Wag from Lemma 2.6 must be strict, because the action of G 
preserves Q-linear relations between z and the arg(q@;). For a more refined version 
of this statement, and more about angle rank, see [DKZB21]. 


2.7 Bounds on Point Counts 


Recall that if A is an abelian variety of dimension g over F, with characteristic 
polynomial (T — a1)---(T' — ag), then 


#A(Fz) = (1 —@)-++ (1 — Qa). 


'A Hodge class is an étale cohomology class that is invariant under the twisted action of Frobenius. 
A Hodge class is exceptional if it is not in the ring (under cup product) generated by Hodge classes 
of weight two. The Tate conjecture states that the space of Hodge classes is spanned by the images 
of algebraic cycles under the cycle class map to /-adic cohomology. This remains an open problem. 
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The Weil bound implies 


(JG —b° <#A(Ry)8 < Og + 
this can be sharpened a bit to give 


[(./q — 171 < #A(F,)'/8 < Ve + 107) 


(see [AHL12, Théoréme 1.1] or [AHL13, Corollary 2.2, Corollary 2.14]). For 
general A, these bounds are best possible: for g = 1, it follows from the Honda- 
Tate theorem (or an earlier theorem of Deuring) that #A(F,) can take the values 
q +1 [2,/q], and the same is then true for arbitrary g by taking powers. 

In light of the fact (which we just used) that 


#(A1 Xp, A2)(Fq) = #A1 (Fy) x #A2(Fy), 


it is natural to separately consider what happens when A is required to be simple. 
In this case, if we exclude A of low dimension, the Weil bounds can be sharpened 
further. 


Theorem 2.8 ([Kad19, Theorem 2.5]) /f A is simple of dimension g > 1, then 


Lo/q - D*) +15 #AG,)'8 < [0/¢+ 071-1. 


Note that this is an improvement on the bound stated earlier when gq is a square. 
For small g, one can make some additional improvements: 


Theorem 2.9 ([Kad19, Theorem 3.2]) For values of q listed below, one has the 
following lower and upper bounds on #A(F,)!/8 for g => 4. (More precisely, the 
bounds hold for all g outside of an explicit finite set of exceptional choices of A, 
each of which has dimension at most 3; see [Kad19, Table 2, Table 3].) 


qd 2 3 4 8 9 
lower bound, 1 1.359 2.2750 4.635 5.47 
upper bound 4.0347 5.6333 7.3818 13.05 14.303 


Looking at an asymptotic version of the question, one can see that there is not 
much room left for potential improvement in these bounds. Building on [AHL13], 
Kadets has shown the following: 


Theorem 2.10 ({[Kad19, Proposition 1.4, Theorem 1.6]) Define 


a(q) = lim inf #A(Fy)'/*, A(q) := limsup #A(F,)'’8, 
A 
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where in both cases A varies over simple abelian varieties of dimension g over F,. 
Then 


\(Vq-1)" 41 < alg) < (Vg - 7142, Let *|-2-97! < A@ < [Cat 71-1. 


The cases where #A(F,) = 1 have been classified by Madan—Pal (modulo 
Remark 2.12). 


Theorem 2.11 ([MP77, Theorem 4]) Suppose that A is simple of dimension g and 
that #A(F,) = 1. 


° We must have q < 4. (This is easy: ifq = 5, then ./q —1 > 1 and the Weil bound 
implies the claim.) 

¢ Ifq =3o0rq =4, then g = 1. (Forq =4, this is the equality case of the Weil 
bound. For q = 3 amore careful argument is needed.) 

¢ Ifq = 2, then the characteristic polynomial of A belongs to an explicit (infinite) 
list enumerated in op. cit. 


Remark 2.12 In [MP77, Theorem 4], the case g = 2 of Theorem 2.11 is asserted 
modulo a conjecture of Robinson [Rob64]: for n a positive integer and ¢, = e27!/", 
eo + 6¢, + 1 is not a square in Q(¢,,) unless n = 7 or n = 30. This was confirmed 
by Robinson [Rob77] using a method of Cassels [Cas69] and Loxton [Lox72]. 


Remark 2.13 A closely related question to Theorem 2.11 is to classify simple 
abelian varieties A of genus g over Fy for which #A(F,) = #A(F,”) for some 
n > | (meaning that A acquires no new points over Fy”). In [Kad19, Corollary 3.4], 
it is shown (using Theorems 2.8 and 2.9) that this cannot occur for n > 4; for 
n = 3 it can only occur for g = 2, g = 1; and for n = 2 it occurs precisely when 
A is the quadratic isogeny twist (see Definition 3.3) of one of the cases listed in 
Theorem 2.11. 

This in turn implies a corresponding classification for curves over Fg which 
acquire no new points over Fan, using the classification of function fields of class 
number one (see Example 5.16). 


3 Algorithms 


We summarize some of the main algorithms used to compute data about abelian 
varieties in the LMFDB. The code [DKRV20] used is available at 


https://github.com/LMFDB/abvar-fq. 
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3.1 Enumerating Weil Polynomials 


We begin with the algorithm for enumerating Weil g-polynomials of fixed degree 
for a fixed prime power q. This algorithm originated in [AKR10] and was described 
in more detail in [Ked08, Section 5] and [KS16, Section 2], to which the reader is 
referred for more details. 


Remark 3.1 The code for enumerating Weil polynomials was incorporated into 
Sage-9.1.beta0 [S*20] in January 2020. 


As noted earlier, identifying Weil g-polynomials of degree 2g 
P(T) =T78 +a,T78 | +... +a, 
is equivalent to identifying polynomials of degree g with integer coefficients 
Q(T) =T8 +b Te '+.--+b, 
whose roots are all real and belong to the interval [—2,/q, 2,/q], via the relation 


P(T) = T& Q(T +q/T). 


In particular, for any integer i € {1,...,g}, aj,...,a; uniquely determine 
b,,..., bj and vice versa. The strategy is to catalog these polynomials recursively: 
given a putative choice of aj, ..., aj; (or equivalently bj,..., bj;-1), use various 
techniques to impose necessary conditions on a; (or equivalently b;), then recurse 
on the remaining options. Some of these necessary conditions are as follows: 


¢ Rolle’s theorem: the roots of o's —)(T) must all be real and belong to 
[—2,/q, 2,/q]. This can be verified by computing a subresultant (Sturm-Habicht) 
sequence. 

¢ Connectivity: the set of values of b; satisfying the previous condition is either a 
closed interval or empty. 

e Power sums: the i-th power sum of the roots of P(7) must have absolute value 
at most gi! 

* Descartes’s rule of signs: the polynomials Q(T +2,/q) and (—1)§ Q(—2./q—T) 
have all roots real and nonpositive, so their coefficients must be nonnegative. 


¢ Hamburger criterion: for so, 51, ... the sequence of power sums of Q, the Hankel 
matrix 
SO S] S92 eee 
S] S2 S3 eee 


S2 S3 S4 °° 


is nonnegative definite (equivalently, its principal minors are nonnegative). 


Isogeny Classes of Abelian Varieties over Finite Fields in the LMFDB 387 


¢ Hausdorff criterion: for all i, j, the sum of (2,/q — x¥ A/¢ +x)/ as x varies 
over the roots of Q is nonnegative. 


A crucial but counterintuitive point is that while some subsets of these conditions 
together form sufficient constraints to identify Weil g-polynomials, no subset gives 
sufficient “on-line” criteria. That is, they cannot be applied directly to detect whether 
a given sequence aj, ..., ag can be extended to give the coefficients of some Weil 
polynomial; the conditions can only be checked once a full sequence has been 
generated. Therefore we impose “more” conditions than are sufficient to cut down 
the search space as much as possible as early as possible in the computation. 

One serious issue with this calculation is that, while it is trivial to check the output 
for false positives (e.g., Sage has a built-in function is weil polynomial 
to test whether a given integer polynomial is a Weil polynomial), it is rather 
difficult to check for false negatives other than by generating Weil polynomials 
using another method (e.g., by computing zeta functions of abelian varieties) and 
checking for their presence. One useful consistency check is to run the computation 
for g = 1, where the answer is known: by Kronecker’s theorem, the irreducible Weil 
q-polynomials for g = | are precisely the cyclotomic polynomials. One can also 
attempt to use the explicit descriptions of the spaces of Weil polynomials given for 
g = 3 in [Hal10], for g = 4in [HS12], and for g = 5 in [Soh13]. Our results mostly 
matched in the g = 3 case (we found a few extra non-simple examples), while the 
conditions in [HS12, Thm. 1.1] and [Soh13, Thm. 2.1] were not actually satisfied 
by all of the Weil polynomials that we found. 


3.2. Point Counts 


As noted earlier, for A an abelian variety of dimension g, we have 
H'(A) = A‘H'(A) fori = 1,...,2g, so the zeta function of A is 
determined completely by the L-polynomial L(T). In particular, if we write 
L(T) = [78,1 — aT), then 


2g 
#A(F,) = [ [a —a;) = L(1). 


i=1 
Similarly, for any positive integer r, 


2g 
#A(Fyr) = [[a — ai) =Res(L(T), T’ — 1). 


i=l 
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In particular, 


#A(E,2) = L()L(-D. 


3.3. Curve Point Counts 


If A is isogenous to the Jacobian of a curve C, then the sequence cy, := #C(Fq") 
satisfies 


L(T — 
poe) = exp os ly 7 
(l=f)1—@f) (2 
n= 
In fact, whether or not A is isogenous to a Jacobian, the sequence c, consists of 
integers, which we report as the point counts of a “virtual curve” with Jacobian A. 
If the c, violate any known constraints on the point counts of a curve of genus g, 


then A cannot be isogenous to a Jacobian; these include trivial constraints (such as 
cy > Oand cmp => Cy) and some less trivial ones (e.g., the Ihara bound). 


3.4 Base Change, Primitivity and Isogeny Twists 


If A is an abelian variety over F,, then the Weil polynomials for A and for the base 
change of A to Fj, are related as follows. 


Proposition 3.2 Suppose P(T) = Iie (T — aj) is the Weil polynomial associated 
to A/F by the Honda-Tate theorem. Then the Weil polynomial associated to the 
base change of A to Fgr is 


2g 
P,(T) = | [(T — af). 


i=1 


Proof The roots of P(T) are the eigenvalues of the action of the g-Frobenius on 
H(A), and the qg”-Frobenius is just the rth power of the g-Frobenius. Oo 


Definition 3.3 We say that A is primitive if A is not isogenous to the base change of 
any abelian variety defined over a subfield of F,. We say that two abelian varieties 
A and B are isogeny twists if they become isogenous after some finite extension of 
F,. The isogeny twist class of A is the set of isogeny twists of A. 
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Corollary 3.4 [f A and B are simple abelian varieties of dimension g over Fg with 
associated Weil numbers a and B, then A and B are isogeny twists if and only if 
a = CB for some root of unity ¢. 


We use various methods for computing P,(7). The simplest is to just factor 
P(T) approximately over C, raise each root to the rth power and then recognize 
the coefficients of the product as integers. In order to make this approach rigorous, 
one needs to use ball arithmetic in C and increase the precision if there are multiple 
integers within the error bounds for any coefficient. 

The second approach is to symbolically express the coefficients of P,(T) as 
polynomials in the coefficients of P(T) using Newton’s identities to change basis 
between elementary symmetric polynomials and power sums. For small values of 
r the resulting transformation is not difficult to compute but for values of r larger 
than about 10 the memory footprint of the algorithm grows rapidly. We therefore 
only use this approach for smooth values of r, where cached transformations can be 
repeatedly applied to compute the overall base change. 

Finally, the base change can be computed using polynomials over the cyclotomic 
field Q(¢,). In particular, we can use the identity 


r—1 
PP y= [Pen 


i=0 


to determine P, (7). An equivalent approach is to compute P,(T) as the resultant of 
P(U) and U’ —T. 

Since we are enumerating Weil polynomials for many q, we can use these 
methods for computing base changes to easily determine which isogeny classes 
are primitive, and to find primitive models for those which are not. We simply 
compute all base changes from F, to Fy, when both prime powers are contained 
in the database. 

Determining which abelian varieties are isogeny twists of each other is a little 
more difficult. The condition in Corollary 3.4 is difficult to determine directly from 
Weil polynomials. We can improve it slightly as follows. 


Proposition 3.5 Two simple abelian varieties A and B are isogenous over F, if 
and only if there exists a number field K containing Galois conjugates 1', and 1, 
of 1 and 1g such that m',Ox = 1ROK. 


Proof The requirement that 2’, and 2, generate the same ideal is equivalent to 
a’, = us’, for some unit u € Ox. Then u has absolute value | at every finite place, 
and because |,|, = |’ ,lv = ./q for every infinite place v, u has absolute value | 
at every infinite place as well. Therefore u is a root of unity by Kronecker and we 
may invoke Corollary 3.4. oO 


This result is not sufficient for our purposes for two reasons. First, it seems 
to require computations in Galois closures since we need to work with arbitrary 
conjugates of z/, and z,. Second, as with Corollary 3.4, it only applies when both 
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A and B are simple, yet it is possible for a simple abelian variety to be a isogeny 
twist of a non-simple one. 

Instead, we break up the set of isogeny classes for each g and gq into clusters based 
on invariants, then use a pairwise test to further refine these clusters into isogeny 
twist classes. Isogeny twists will have the same slopes and the same geometric 
endomorphism algebra, whose computation we describe in the next section. These 
two invariants are sufficient to divide up isogeny classes into clusters whose size is 
already usually in the single digits, and is at most several hundred for the values of 
g and q that we consider. We then use the following result for each pair of isogeny 
classes in the cluster. 


Proposition 3.6 ((CMSV19, Sec. 7.2]) Suppose A and B are abelian varieties 
over Fy with Weil polynomials P(T) and Q(T). If there is an isogeny from A to 
B defined over Fyr then the cyclotomic polynomial ®,(T) divides the resultant 
Res, (P(z), 2° O(T/2)). 


If we set m to be the least common multiple of the orders of all cyclotomic 
polynomials dividing this resultant, then we can determine whether A and B are 
isogeny twists by computing the base changes P,,(7T) and Q,,(T) and checking for 
equality. Note that to find the product of all cyclotomic polynomials dividing a given 
polynomial, it is not necessary to factor it fully; there is a more efficient algorithm 
of Beukers—Smyth [BS02, §2] that finds cyclotomic factors using the Euclidean 
algorithm. (In Sage, a polynomial over Q has a method cyclotomic part 
implementing this algorithm.) 


3.5 Endomorphism Algebras 


The method for determining the Brauer invariants of the endomorphism algebra 
End°(A/F q) is described in Sect. 2.3; given an irreducible Weil polynomial h 4 these 
invariants give the power e, such that ne is the characteristic polynomial of an 
isogeny class of abelian varieties over a finite field. We can also use Proposition 3.6 
with B = A to find all possible extension degrees where the endomorphism algebra 
might change, and then compute the endomorphism algebra anew for each such base 
change. These two results allow us to, for an isogeny class defined over a finite field 
F,, give the endomorphism algebra of the base change of this isogeny class over 
any finite extension of Fy. 

We refer to the minimal extension over which all endomorphisms are defined as 
the endomorphism field; one can search the database by degree of the endomorphism 
field, which we call the endomorphism degree. 

For the convenience of the reader, we now explain how to describe the endo- 
morphism algebra of a simple isogeny class A over a finite field F, given its 
Brauer invariants and its characteristic polynomial. This is enough to describe the 
endomorphism algebra of any isogeny class, as, if A is isogenous to an abelian 
variety 
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where each A; is simple and A; is not isogenous to A; if i 4 j, then 
End°(A/F,) = Mn, (End(A1/Fq)) x «+» x Mn, (End(A,/F,)). 


Assume now that A is simple. We begin by determining the center of the endomor- 
phism algebra of A: If the characteristic polynomial is hi for h, irreducible, then 
the center of End°(A /F) is the field Q[x]/ha seieiated by 4. We then compute 
the degree of End?(A/F q) Over its center. Its square root is given by the order of the 
class of End°(A /F,) in the Brauer group of its center, by Tate [Tat66, p. 142]. By 
the exact sequence 


0 Br(F4) > G)Br(Fa.v) > Q/Z > 0, 


Vv 


where Fa = Ql[x]//a4, this is the least common denominator of the Brauer 
invariants of “End? (A/F, ), or more simply put, e,. 

To complete our description of End°(A /F,), we note that by Tate [Tat66, 
Theorem 2 and its proof], if Q[x]/h, has a real place, then either Q[x]/h4 = Q 
and End°(A /Fq) is the quaternion algebra over Q that is ramified at p and ov, 
or Qix]/ha = QC /p) and End°(A /F,) is the quaternion algebra over Q(,/p) 
ramified at both real places and nowhere else. 

Otherwise, Q[x]// 4 is totally complex. If e4 = 1, then End?(A/F,) = =Q[x]/ha. 
Otherwise, End°(A /F,) is ramified only at places dividing p, and we identify 
the isomorphism class of End°(A /Fq) by its degree over Q[x]/h, and its Brauer 
invariants at the places above p in Q[x]/ha. 


3.6 Principal Polarizations 


A priori, the isogeny class associated to a Weil polynomial consists of unpolarized 
abelian varieties. In particular, to show how polarizations may vary within an 
isogeny class, we remind the reader of the following standard facts: 


1. For g > 2 and over an algebraically closed field, every abelian variety is 
isogenous to one that is principally polarizable. 

2. For every field, there exist abelian varieties defined over that field which are 
not principally polarizable. For example, over an algebraically closed field of 
characteristic zero, there is a simple proof that every principally polarizable 
abelian variety is isogenous to an abelian variety without a principal polariza- 
tion.” However, this last fact is not true in general, see Example 5.10. 


?See https://mathoverflow.net/questions/16992/non-principally-polarized-complex-abelian-variet- 
ies. 
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3. Polarization also behaves poorly under decomposition into isogeny factors: An 
abelian variety can be principally polarizable without all of its isogeny factors 
being principally polarizable. See [How95, Example 13.8] or 3.2.ac_c_ad, which 
contains a Jacobian whose two-dimensional factor is not principally polarizable. 


The takeaway from these facts is that the quality of admitting a principal 
polarization is not an isogeny invariant. Therefore in the database, when we say that 
an isogeny class is principally polarizable, we mean that there exists some abelian 
variety in the class which is principally polarizable. 

Of the 2,945,722 isogeny classes in the database, there are only 3037 such that 
we cannot determine if the class has a principal polarization. These isogeny classes 
all occur in dimension greater than 3: 358 are in dimension 4, 515 are in dimension 
5, and 2164 are in dimension 6. 

We now discuss methods for determining when an isogeny class is principally 
polarizable. This code was provided by Howe; the version we use is implemented 
inthe has principally polarizable function [DKRV20]. 

For simplicity, the algorithm considers only the case of simple abelian varieties. 
We note that since an isogeny class is principally polarizable when all of its isogeny 
factors are, the database does indicate that a nonsimple isogeny class is principally 
polarizable when all of its factors are. We remind the reader however that, as 
remarked above, the converse is not true, which does mean that certain non-simple 
cases are currently completely out of the reach of our implemented tests, and might 
be principally polarizable even though not all of their factors are. 

Our algorithm depends on the dimension of the isogeny class: In the case g = 1, 
every isogeny class is principally polarizable. In the case g = 2, we apply a result 
of [HMNR0O8], which tests for principal polarizations based on a condition on the 
coefficients of the Weil polynomial. These conditions are that an isogeny class is not 
principally polarizable if and only if ay — a2 = q, a2 < 0, and every prime divisor 
of az is 1 mod 3. 

We now list the collection of tests that are implemented in the database in higher 
dimension. First, if g is odd and A is simple then its isogeny class is principally 
polarizable, which takes care of g = 3 and g = 5. 

In addition, in the ordinary case, we can completely determine whether an 
isogeny class is principally polarizable, using Corollary 11.4 and Proposition 11.5 
of [How95]. We note that the details of the proofs are given in §14 of [How93] for 
readers who would like to see them. The test relies on the fact that in the ordinary 
case, Nx/Q(m —q/z) is always a square and we can consider its positive square root 
N.If q > 2, there is a principally polarized variety in the isogeny class if and only 
if N = (coeff of T*) mod q. If gq = 2, we have a similar congruence condition but 
for a power of 2: there is a principally polarized abelian variety in the isogeny class 
if and only if N = (coeff of T%) mod 2 


Remark 3.7 In [How95], Howe gives an algorithm for determining when an 
ordinary isogeny class (simple or not) contains a principally polarizable variety, 
but the non-simple case has not been implemented yet and we did not implement it 
for the database. 
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Remark 3.8 Howe has explained to us that the congruence conditions come from 
wanting to determine if N, the positive square root of Nx/Q(z + q/z7), is the same 
as the square root specified by certain p-adic conditions. The result is then explained 
by the fact that the middle coefficient of the Weil polynomial is congruent modulo g 
to the square root specified by the p-adic conditions. When g > 2, the congruence 
is enough to compare the signs of the two square roots. However, when g = 2 we 
have | = —1 mod 2, which explains the need to work modulo 4. 


Finally, in even dimension, we verify some conditions on the field K generated 
by the Weil g-number z to allow us to determine if certain isogeny classes are 
principally polarizable (this is [How96, Theorem 1.1]). First, if K is totally real then 
the class is principally polarizable. (Although if g > 4 and A is simple, K is always 
a CM field.) Otherwise, K is a CM field. To set up the notation we will need, let 
P(T) be the characteristic polynomial of an isogeny class of abelian varieties. Write 
P(T) = A(T) for h irreducible; then A is the minimal polynomial of a Weil q- 
number z. Let K = Q(z) and K+ = Q(1+q/z7) be the CM field and its totally real 
subfield, respectively. Then we know that there is a principally polarizable abelian 
variety in the isogeny class associated to P(T) if either K /K is ramified at a finite 
prime, or there is a prime of K+ that divides 7 — q/z and is inert in K/K,. We 
note that this second condition requires some work to test for, and we use the fact 
that a prime is inert in K /K + if and only if the prime ideal is equal to its complex 
conjugate to do so. 


3.7 Jacobian Testing 


Given a characteristic polynomial, to test if there exists a Jacobian with this 
characteristic polynomial, we apply six results from Howe and Lauter’s Magma 
package http://ewhowe.com/Magma/IsogenyClasses.magma. (They were re- 
implemented for the LMFDB by Howe.) This code accompanies the paper [HL 12]; 
see especially Sect.6 of the article for a high-level overview of the software. 
In addition, the comments in the code are excellent, so rather than repeat an 
explanation of the tests, we refer the reader to these two excellent references. We 
note that currently the LMFDB does not implement positive Jacobian testing in 
genus 4 and higher. 


3.8 Angle Rank 


We have two algorithms to compute the angle rank of an isogeny class of abelian 
varieties: one that is numerical, and one that is algebraic and therefore yields a 
provably true answer. In the current version of the LMFDB, the angle rank 5(A) is 
computed numerically using lattice basis reduction. 
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This is done in the following way: We first approximate the roots a; of 
the characteristic polynomial P(T) numerically as pairs of floats (the real and 
imaginary parts of the number). We then pair each root @; to its complex conjugate 
(which is also a root of P(7T)) and retain only one number from each pair of complex 
conjugates, as we know that arg(a;)/27 and arg(@;)/27 have a linear relation over 
Q. Following this, we compute arg(@;)/27 numerically (using the principal branch 
of the logarithm) for the g remaining q; to get values t), ..., tg. Finally, to determine 
the dimension of the Q-span of the appropriate set (the reader can go to Sect. 2.6 
for a reminder of the definition of angle rank), we apply an LLL algorithm with a 
certain precision to the tuple [4,..., tg, 1] (PARI’s lindep). Note that in these 
computations the branch of the logarithm chosen for the computations does not 
matter because we eventually take a Q-vector space span with 1. 

For the sake of completeness, we present below the algebraic algorithm yielding a 
provably true answer as well. Roughly speaking, the algorithm relies on expressing 
the roots of P(T) in a common generating set for some S-units of the splitting 
field of P(T). It would of course have been preferable to use this algorithm in the 
database instead, but at the time this computation was performed for the database 
(2015), the implementation of the software computing S-units in Sage was not fast 
enough to deploy on the full database. See Sect. 6.1 for a possible workaround. 

We now present the algebraic algorithm: Let A be an abelian variety over Fy 
where gq = p“. Let K be the splitting field of P(T) = Pa(T) = Tht — aj). For 
the computation of the angle rank we consider 


T = (a1, 02,...,Q2¢), 


the multiplicative subgroup of K * generated by the roots of P4(T). In order to 
compute 5(A) we use the fact that 


rk(P) = 8(A). 


The first two authors discuss the group I in the sister paper [DKZB21], where some 
explicit relations are computed. Let S = {P € Spec(Ox) : P|p} be the collection 
of primes of K above p. The key observation is that I" is a subgroup of the group of 
S-units of K: 

r< Ox 5: 


Using Sage we can then compute generators* for On s: 


Ons = (6, U4, inngally) 


3The particular function we use was written by Cremona: http://doc.sagemath.org/html/en/ 
reference/number_fields/sage/rings/number_field/unit_group.html. We also remark that there is an 
additional quite large bottleneck in this algorithm due to the need to compute the splitting field of 
the characteristic polynomial. 
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where ¢ is a root of unity generating the torsion part of OF K,s and uj,...,U, are of 
infinite order, and attempt to compute the rank of I in this basis, before Hrocentine: 
however, we eliminate the torsion part of I’, if any. Let m be the cardinality of the 
group of roots of unity in K*. Then to compute the rank of I" it suffices in fact to 
compute the rank of T’” = (aj",..., @5 3g)? since 


tk(T) = 8(Ag,) = 6(AF jm) = rk(P”). 


Here, the equality of angle ranks follows from the fact that 6 computes Q-linear 
relations, and arg(a@)/2m and arg(B)/27 have a Q-linear relation if and only if 
arg(a””)/2z and arg(B’”")/27 do. 

We then write each a” in terms of the generating elements u;, and obtain a 
vector of exponents with integer coefficients. We then form ar x 2g matrix Y whose 
columns are these vectors, and the rank of this matrix gives us our answer: 


rk(Y) = 6(Ap,) +1. 


We note that the explicit relations in mentioned above are derived from the matrix 
Y; several examples of these computations can be found in [DKZB21]. 


4 Statistics vs. Heuristics 


This database naturally invites investigation of the following motivating ques- 
tions: 


¢ How does the number of objects in the database vary with g and g? 

¢ How about if we sort by the Galois group of Q(z), the Newton polygon, or the 
angle rank? 

¢ How is the number of points on the abelian variety distributed? 

e What are the extreme values of the number of points? 


In this section, we gather the data available in the database and informing these 
questions, and compare it to the predictions given by heuristics. 
4.1 The Number of Isogeny Classes 


In order to set bounding boxes for our data collection, we needed to estimate 


N(g,q) = number of isogeny classes of g-dimensional abelian varieties over Fg. 
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We initially chose the limits described in Table | using an incorrect estimate on 
the growth of N(g,q): Believing that N(g,q) grows like g&&*+))/? (rather than 
the correct asymptotic g88+)/4; see below), for each g we included data for g 
with g8&+)/? < 10’ to bound the number of isogeny classes per pair (g, q).* We 
later extended the bounds in order to include more fields of small characteristic in 
dimension up to 3, and to make the range of included q contiguous in dimension 3.° 

We now give a more careful analysis that better models the numerical data that 
we have observed for N(g, q). Since by Honda-Tate these isogeny classes are in 
bijection with characteristic polynomials, one reasonable heuristic is to count Weil 
q-polynomials of degree 2g. This amounts to counting lattice points in the set of 
(a1,..., dg) € R® for which the polynomial 


T8 +aiT&"|+--- +a 
has all roots in the interval [—2,/q, 2,/q]; it is reasonable to approximate this count 


by the volume of the region. This volume has been computed by DiPippo and Howe 
[DH98, Proposition 2.2.1]:° it equals 


8 &§ Ii gtl-i 
ae g(gtl)/4_ 4.1 
(2 I] (5 - :) : on 


i= 


This turns out to be a good prediction in practice; see Table 2 for some examples. 
For the sake of completeness, we note two facts: First, according to [DH98], 
for q large compared to g, the dominant contribution to the count is from ordinary 
abelian varieties. Secondly, to obtain the number of ordinary isogeny classes from 
the formula for the total number of isogeny classes, we simply multiply by a factor 


of g(q)/q. 


Table 2 Predicted versus 
actual values for the number 
of isogeny classes of 
ordinary/arbitrary abelian 
varieties of dimension g 
over Fy 


Ordinary Arbitrary 
q | Predicted | Actual | Predicted | Actual 
25 | 284,444 | 284,740 | 355,556 | 332, 166 
104,025 | 105,600 | 130,032 | 132,839 
170,796 | 171,180 | 256,194 | 267,465 
2 72,362 74,122 | 144,724 | 164,937 


DH) Nn) BR] who 


4We also included (5, 3), where g8(stD/2 ~ 1.4-107, and we only included g up to 500 for g = 1, 
rather than 107. 


5Namely, we added powers of 2, 3,5, 7 up to 1024 for g € {1, 2}, and raised the bound for g = 3 
from 13 to 25. 

®We remark that the bounds used in op. cit. are also related to the original iterator described in 
[Ked08] which some reader might find enlightening. 
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Table 3 Predicted versus a b 
actual east aquates) vanes g_ | Predicted | Actual | Predicted | Actual 
for the equation es : ee : : 
log N(g,q) = alog(q) +b 1 [05 (0.4971 | 1.3863 | 1.3717 | 
2 {15 | 1.4178 | 2.3671 | 2.5302 
3 3 2.9135 | 3.1248 3.2598 
455 4.5452 | 3.7283 4.2707 _ 
5 |7.5 7.2188 | 4.2141 | 4.5660 
40 
35 
30 
25 
20 
15 
G=5 g=3 
10 g=4 
5 =e 
g — 
1 2 3 4 5 


Fig. 1 Plots of (logg, log N(g, q)) for each g and g < 27 in the database. The least square values 
of log N(g, g) = alogg + b are given in Table 3 


Another way to evaluate our prediction for N(g, qg) is to make a least-squares fit 
for values a, b such that log N(g, g) © alogg +b. The results are given in Table 3. 
The log-log plot of gq vs N(g, q) is given in Fig. 1. 

While we do not yet have data for isomorphism classes (see Sect. 6.3), it is worth 
noting that one can derive heuristics for counts of isomorphism classes within an 
isogeny class using Oort’s theory of isogeny leaves in moduli spaces of abelian 
varieties [Oor09]. 


4.2 Galois Groups 


Let P(T) be the characteristic polynomial of an isogeny class of abelian varieties, 
and Q(z)2! be the splitting field of P over Q. How can one expect Gal(Q()2"/Q) 
to be distributed as the isogeny class varies? To answer this question, we need 
to explain Malle’s conjecture [Mal02, Mal04] and the invariant a(G) for a finite 
group G C S,. The exposition given here follows the treatment in [Mal02] (see the 
introduction for example). Given g € S, we define 


index(g) = n — #number of cycles in the cycle decomposition of g. 
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We then define 
(G) = 
OW” ~ minfindex(g):2€G\1}° 

Example 4.1 

1. index((12)(34)) =4-—2=2 

2. a(S4) = 1 

3. a(D4) = 1 

4. a(Ag) = 1/2 

5. a(C4) = 1/2 


Let K be a number field, and 7 be an integer. We define the following counting 
functions for number fields L of degree n over K: 
Nxn(X) =#H{L:[L: K] =n,|Disc(L/K)| < X}, 
Nxn,G(X) =#{L: [L: K] =n, |Disc(L/K)| < X, Gal(L8"/K) = G}. 


First, we have Linnik’s conjecture (c.f. op. cit.) that there exists a constant 
co = co(K, n) such that 


Nx n(X) ~ coX as X > oo. (4.2) 


Next, the (weak) Malle conjecture states that for all ¢ > 0 there exist constants 
C1, C2.¢ such that 


cyX*) << Ne n G(X) < c7,.X*O*, (4.3) 


Assuming these conjectures, given a base field K, we can describe how the 
proportion of extensions with Galois group G should behave on a log-log scale: 


Lemma 4.2 Assuming Eqs. (4.2) and (4.3) we have 


log N xX 
lim log Nx,G,n(X) =a(G). 
X +00 log Nx n(X) 


Proof Assume the Malle and Linnik conjectures. We will prove the upper 
bound and omit the proof of the lower bound as it is similar and easier. Since 
NK n(X) ~ coX, there exists some €;(X), approaching zero as X — ov, such that 
Nx n(X) = coX(1 + €1(X)). Applying Malle’s conjecture, we then have, for all 
e>0, 


log c2,¢ 

log Nx.n,G(X) a(G) + € + oocx) 
ieee) fa es deny ea Oe 

OB NK,n I+ Tos(x) + — log(x) 
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This proves that for all e > 0 


log NxK,n,G(X) 


G ’ 
{ae 


and hence 


log N X 
wag toe K,n,G( ) << a(G). 
X>oo log Nx .n(X) 


A simpler bound using the lower end of Malle’s conjecture proves 


log NV xX 
a(G) < lim og K,n,G ( ) 
X>o0 log Nx n(X) 


which gives the result. Oo 


Because of this result, we might expect that a group G should appear as 
Gal(Q(z)®*'/Q) with frequency such that the log-log ratios have a limiting value 
for each G as gq — co. While this does seem to be the case, the precise value of 
these ratios does not seem to coincide with Malle’s constant a(G), and seems to be 
more complicated. Table 4 shows them for g = 3. If we let @(G) denote these limits, 
to the authors it seems that 


a(6T11)=1, a(6T3) =a(6T6) + 3/5, 1/10 < a(6T1) < 3/10, 


and we have no theoretical explanation for why this is the case. We do note that there 
seems to be some partial progress on these type of conditional Malle distributions 
in [BSMT17]. 


Remark 4.3 Alternatively, it may be the case that these distributions fit closer to a 
“polynomials-in-a-box” distribution. The study of the Galois group of a random 
polynomial goes back to van der Waerden [vdW36], and an excellent summary 
(and further developments) is given in [Zyw1l0]. One can prove (see [BSK20]) 
that the number of monic polynomials of degree d which are irreducible and have 


Table 4 For g = 3 we have plotted log N(g, g, G)/ log N(g, g). Observe the phenomena for 6T1 
at powers of 3. Also, note that none of these can possibly be a(G) for some G, which must be of the 
form 1/n for 1 <n < 2g = 6 (as these numbers are 1.00, 0.50, 0.33, 0.25, 0.20, 0.16) 


Group\q|2—U3 AS 
6T1 0.31636 | 0.11844 | 0.26715)—oo —_| 0.30254 | 0.23744 | 0.14698 | 0.25954) 0.28031 
2T1 0° 00 00 0° 00 0.079146 | —oo —co | oo 


6T3 0.60225 | 0.59221 | 0.66064 | 0.58783 | 0.60854 | 0.54666 | 0.63575 | 0.57555 | 0.60162 
6T11 0.88343 | 0.96285 | 0.97157 | 0.98768 | 0.99256 | 0.99380 | 0.99424 | 0.99707 | 0.99743 
6T6 0.60225 | 0.60257 | 0.58913 | 0.59649 | 0.58877 | 0.60836 | 0.58708 | 0.59022 | 0.59007 
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coefficients in [—L/2, L/2] is asymptotic to L7. Let’s call this number By(L). If 
we let Bg.g(L) be the number of monic irreducible polynomials with coefficients 
in [—L/2, L/2] with Galois group G, then in analogy with Malle one could naively 
guess that for every group G transitive on d elements, there exists some a(G) < 1 
such that for every ¢ > 0, there exist constants c; and c2,- and R > 0 such that for 
every L > R one has 


cy LAO) < Ba,c(L) < ta pL rer), 


If such constants exist, is it the case that a(G) = a(G)? 


4.3 Newton Polygons Data and p-rank Strata 


As discussed in Sect. 2.4, for any given positive integer d, the coarse moduli space 
Ag.d of g-dimensional abelian varieties equipped with a polarization of degree d* 
admits a locally closed stratification by Newton polygons, in which the stratum 
corresponding to an individual polygon is equidimensional of codimension equal 
to the elevation of the polygon. A reasonable guess is that for any given eligible 
Newton polygon P in dimension g, the proportion of isogeny classes of abelian 
varieties over F, with Newton polygons lying on or above P is cq~° where e is the 
elevation of P and c is the number of geometrically irreducible components of the 
stratum over F,. By Theorem 2.5, c = | unless P is the supersingular stratum; in 
the supersingular case the stratum is reducible and not all irreducible components 
may be defined over Fy, but it is guaranteed that c > 0 [Yul7]. One can even give 
an explicit formula for c when g = p [Ibu18, Thm. 4.6]. 

For example, in dimension 3 the Newton polygons are linearly ordered (see 
Fig. A.1). In Fig. 2, we give a plot of 


NG, q, P) 
log, (A) 
NG,q) 

for each of the five possible Newton polygons, where N(g,q, P) is the number 
of isogeny classes with Newton polygon on or above P, and N(g, q) is the total 
number of isogeny classes. Note that the values for g prime agree quite well with 
the discussion above, while for non-prime g there are more isogeny classes in the 
smaller strata than expected. We have no explanation for this behavior. Moreover, 
the supersingular stratum lies consistently above —4 and log, (N(3, q, P)/N(3, q)) 
is increasing with q, suggesting that the number of geometrically irreducible 
components increases as some nonzero power of q. 


Remark 4.4 In dimension 4, every Newton polygon stratum in Aq; occurs for 
the Jacobian of some curve. For strata of codimension at least 2 this is implied 
by Shankar and Tsimerman [ST18, Proposition 2.4]; this leaves only the ordinary 
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Fig. 2 log, (NG, q, P)/N(3, q)) as a function of g, where N(g,q, P) is the number of isogeny 
classes with Newton polygon on or above P. Colors indicate P, with orange the ordinary stratum 
and green the supersingular stratum 


stratum which includes the generic Jacobian, and the almost ordinary stratum which 
includes the generic non-ordinary Jacobian. 


Problem 4.5 The following question is due to Pries (see http://aimpl.org/ 
cohomabelian/): Let p be a prime number. For a fixed dimension g and a fixed 
p-rank f, what is the smallest field of definition of a simple abelian variety over F Pp 
with dimension g and p-rank f? 


For each pair (g, g) for which the LMFDB contains data, we have verified that 
each of 0,..., g occurs as the p-rank of at least one simple abelian variety of 
dimension g over Fy. 


4.4 Frobenius Angle Rank 


Table 5 shows for each g and angle rank 6 with 0 < 6 < g which p-ranks do not 
appear in our database. 
From Table 5 some simple patterns emerge. 


e Angle rank 0 implies p-rank 0. This is known in general because angle rank 0 is 
equivalent to supersingularity. 

¢ Upto g = 4, every positive angle rank can occur for ordinary abelian varieties 
(i.e., for p-rank g). However, please note that for g = 5 we only cover g = 2, 3, 
and for g = 6 we only cover g = 2. 

¢ Ifé =1 then the p-rank is 0 or g. 
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Table 5 Excluded P -tanks Dimension | Angle rank | Forbidden p-ranks 
for simple abelian varieties a a 
1 0 1 
1 0 
2. 0 1,2 
1 0,1 
2 0 
3 0 1,2,3 
1 1,2 
2 0,1,3 
3 none 
4 0 1,2,3,4 
1 1,2,3 
2 1,3 
3 3 
4 none 
5 0 0,1,2,3,4,5 
1 1,2,3,4 
2 0,1,2,3,4,5 
3 0,1,2,3,4,5 
4 1,3,5 
5 none 
6 0 1,2,3,4,5,6 
1 1,2,3,4,5 
2 1,5 
3 1;3,5 
4 0,1,3,5,6 
5 5 
6 none 


¢ For g > 3, if 6 = 2 then the p-rank cannot be 1. 

¢ The p-rank g — | only occurs when 6 = g — 1 or dé = g. This follows from 
a theorem of Lenstra—Zarhin [LZ93, Theorem 5.7], who also show that the case 
6 = g — | cannot occur if g is even [LZ93, Theorem 5.8]. 


For discussions in this direction we refer the reader to the forthcoming paper 
[DKZB21]. 


4.5 Endomorphism Algebras 


We motivate this section with the following open problem: 


Problem 4.6 ([Oor08, Open Problem 22.6]) For each g > 0 and p determine the 
possible endomorphism algebras occurring in that characteristic. 
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As a first step we focus on simple abelian varieties, since the general case is 
expressible in terms of matrix algebras over the endomorphism algebras of the 
simple constituents. We can then break this problem up into two parts: understand- 
ing the center of the endomorphism algebra, and understanding the endomorphism 
algebra as a division algebra over its center. 

The center of the endomorphism algebra of an abelian variety A is just the 
number field Q(z) defined by h4(T), where Pa(T) = ha(T)°. In order to analyze 
the possible centers that can arise, we look at statistics on the discriminant A of 
Q(z). It is useful to normalize the discriminant in two ways. First, we consider the 
root discriminant rd = |A|!/” instead, where n = [Q(zr) : Q]. The root discriminant 
is often more useful when considering number fields of different degrees. Second, 
we use the following result to rescale the root discriminant in a way that allows 
comparison across different values of q. 


Theorem 4.7 Let A be a simple abelian variety of dimension g over Fy with 
associated Weil number x. Then the root discriminant rd of Q(z) is bounded by 


rd < 2g¢q8/”. 


Proof By Mordell [Mor60, Eq. 17], the maximum possible polynomial discrim- 
inant for a polynomial of degree 2g whose roots all have absolute value ./q is 
(2g)?&q8?8—)), Applied to the polynomial 4(T), this yields the bound 


rd < 28 ge/e-1/2, 
e 


This implies the desired bound when e > 1, so we may assume hereafter that e = 1. 
To improve the bound in this case, we distinguish between the discriminant A of 
Q(z) and the discriminant A’ of P4(T); it will suffice to check that the ratio A’/A 
is divisible by g&8—"). 

Put 6 = 2+ q/m, let Ag be the discriminant of Q(f), and let Ab be the 


discriminant of the minimal polynomial of 6 over Q. Let aj,...,a@2,¢ be the 
conjugates of z in Q*8, sorted so that ajO29-; = q fori = 1,..., g. Then on 
one hand, 
A’ : 2 oo oe (“ tj ) (Ce; SS Se 
an (I Sei (ir I a 
g g 8 ; Aw: Ses : feos )\2 
= (foo. arm?) (TF [] (‘es wenten— aletares ale) 
i=! i=l j=i+l (uj + q/oj — &j — q/e4j) 
. \ fr oy (07 07 (ai — aj (aij — G)(ai@j — q)(qau — qaj) : 
= (Fle —aveu?) (Tp Py (eee eer eget?) 
i=l i=l j=i+l a; (i — aj)" (aja; — 4g) 


g 
ay? [ [@ ~q/aj)°. 


i=1 
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On the other hand, the relative discriminant of Q(z) over Q(f) divides the 
polynomial discriminant of x” — Bx + q, which is (7 — q/z)*. Consequently, 


A a - 
a divides | [@ - q/ai). 


i=l 


By writing 


a 3 
Ae (2°) Ab 
A (Aj)? Ao) A 


and noting that Aj is divisible by Ao, we deduce that A’/A is divisible by g8@—)) 
as claimed. Oo 


In Figs. A.2, A.3, A.4, A.5, A.6, and A.7 we give distributions of both the 
polynomial and number field root discriminants for the different values of g in 


the database. In the polynomial case, we divide the root discriminant by 2¢q a 
and in the number field case we divide by 2gq8/*. The distribution in the number 
field case appears to be a sum of copies of the polynomial distribution after further 
rescaling by the appropriate roots of reciprocals of integers. This phenomenon is 
especially apparent in the g = 2 case because of the simple nature of the polynomial 
distribution. Large spikes can be seen at 1/2, 1/3, 1/4 and 1/5 corresponding to an 
extra factor of 2*, 34, 44 or 5* in the index of the maximal order in the equation order 
of the number field; smaller spikes are visible at 1/./2 and 1/./3 corresponding to 
extra factors of 27 and 37. 

Of course, while the endomorphism algebra is usually commutative, sometimes 
it is not. In order to give some insight into the non-commutative cases, we provide 
statistics on the possible Brauer invariants of the endomorphism algebra as a 
division algebra over its center. Table A.10 in the center summarizes the results. 
The length of the sequence of invariants gives the number of places above p in the 
center, and we have collapsed all of the commutative endomorphism algebras into a 
single row for each value of g. 


4.6 Isogeny Sato-Tate Distribution 


What is the distribution of #A (F, )? What about when we restrict to ones with certain 
invariant types? From the Lang-Weil estimates (Sect. 2.7) we know that 

#A(F,) = q° + O(q8—"/?) as gq > oo. 
This asymptotic suggests that the normalized error 


_ #A(F,) — 48 


E: qe 
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will form an interesting probability distribution P,, as we vary over all A’s of a 
fixed dimension g defined over F,. 

Let us consider what happens if we fix g and take a limit as gq —> oo. Writing 
01,..., 2g for the Frobenius eigenvalues with aj@,4; = g fori = 1,...,g, we 
have 


2g 2g 
E=q ® 1] [a-a)—[Jo | = doa a; + og"); 
i=1 i=1 i 


consequently, the distribution of E will have the same limiting behavior as the 
distribution of the normalized Frobenius trace of A. 

The philosophy of Katz—Sarnak [KS99] would predict that the distribution of the 
Frobenius trace should converge to the trace distribution for random matrices in the 
Lie group USp(2g). This convergence holds if we average over isomorphism classes 
of principally polarized abelian varieties, as this forms a geometric family with 
maximal monodromy [KS99, Theorem 11.0.4] to which one may apply Deligne’s 
equidistribution theorem [KS99, Theorem 9.2.6]. 

However, since we do not currently have the data of how many isomorphism 
classes constitute a given isogeny class, we are only able to compute the average 
over isogeny classes. We thus predict a different distribution, given by a function 
whose value at a; is proportional to the measure of the set of (a2, ...,dg) € R® —1 
for which T8 + a;T8—! +--+ + dg has all roots in [—2, 2]. We compute this distri- 
bution using the method of DiPippo—Howe (see Sect. 4.1). By computing Jacobian 
determinants, we see that integrating | over the space of coefficients (a1, ..., dg) is 
the same as integrating |/g! over the space of power sums (pj, ..., Pg), or integrat- 
ing 1/g! times the Vandermonde determinant V(rj,..., rg) = Thetej<g (rj — ri) 
over the space of ordered tuples of roots (71, ..., 7g). That is, the desired distribution 
is given (up to a normalizing factor) by the distribution function 


fs) = | V(ri,....1g)dun, 
SO Ay 


where S denotes the simplex 


S = {ips eR? s 247) S++ Sr, = 2} 


and H, denotes the hyperplane r; + --- +rg = x. Let us write this as an iterated 
integral over r|,...,g—1, Substituting rg = x — rj — +++ —rg_1; the endpoints of 
integration of r; are then 


j-l jvl 


max{rj-1,x—2(g—j)— Dore}, — min{2, |x — Dore | /(g- 7 + D} 
k=1 k=1 
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(writing ro = —2). In particular, the distribution function is continuous, even, and 
piecewise polynomial: on each interval [—2g+4(@ —1), —2g+4i] fori = 1,..., g, 
it is a polynomial of degree (g — 1)(g + 2)/2 with rational coefficients. For 


the extreme values i = | andi = g, this polynomial is a scalar multiple of 
(2g — |x|)8-D@t+2)/2, 
Using Mathematica, we computed the distribution functions f,(x) for g < 4: 


1 
q lx|s<2 
g=1:74 
|x| > 2 


pa2:{7G-bhy bis4 
0 |x| >4 


sis (15|x|4 — 200|x|? + 816) |x| <2 


—3:)3 
8 =3:) 35 (6-|[xI)° 2<|x| <6 
0 |x| > 6 
5(—|x|? —72|x [8 —2304|x|7 +645 12|x|°—516096|x|> +1548288]x|4 —7077888|x |? +24 117248) Ix] <4 
3.227 = 
g=4: =n (8 — |x|)? 4<|x|<8 
0 [x | = 8. 


See Fig.3 for plots of the distribution for g < 4, and Figs. A.8, A.9, and A.10 
for plots of this prediction against our data. A table of moments for g = 3, 4, 5, 6 is 
given in Table 6. 

It is natural to ask about the limit as g — oo. While we do not have a general 
formula for f(x), we can give some evidence that the limiting distribution foo is 
a Gaussian with variance 2, which is also the limit of the Sato-Tate distributions 


hod 
oS 


4 2 


Fig. 3 Isogeny Sato-Tate distribution for g < 4 
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Table 6 Even moments for the isogeny Sato-Tate distributions for g = 3, 4,5, 6 


g |2nd | 4th 6th 8th 10th 12th 14th 
1.7142 | 8.6857 | 71.7575 | 796.1318 | 10750.4655 | 166954.5839 | 2.8786 x 10° 
1.7777 |9.4199 | 82.5201 | 1001.4566 | 15384.2906 | 282674.8553 | 5.9748 x 10° 
1.8181 | 9.8834 | 89.1908 | 1121.6573 | 18035.9973 | 351973.2932 | 8.0435 x 10° 
1.8461 | 10.2041 | 93.7929 | 1203.9623 | 19814.7906 | 397315.2698 | 9.3803 x 10° 


Dim) Bi w 


Table 7 Predicted moments from Eq. (4.4) 


r 4 6 8 10 12 14 
me? 10.2041 | 93.7929 | 1203.9623 | 19814.7906 | 397315.2698 | 9.3803 x 10° 


~(2)\"/2 
(mip?) (r — 1)!!| 10.2243 | 94.3750 | 1219.5797 | 20263.1948 | 411486.7236 | 9.8754 x 10° 


[DS94, Theorem 6].7 Let m&) = f°. x" foo(x)dx and let m§? be a numerical 
approximation to [/ ee x" fo(x)dx. If foo were Gaussian, we would have 


m® = (m@y? or — 1)! (4.4) 


for even r. As Table 7 shows, this identity already holds approximately for g = 6. 
The authors don’t have enough data to determine what happens when we restrict 
to isogeny classes of abelian varieties with a fixed Newton polygon, endomorphism 
algebra, delta rank, or Galois group. We expect similar interesting distributions in 
these cases as g — oo as we normalize the dimensions appropriately. We also 
remark that a determination of the conjectural distribution of point counts also 
remains open (for say, P(T) of a fixed degree 2g and fixed Galois group G). 


4.7 Maximal and Minimal Point Counts 


The question of the maximum number of points on a curve of given genus over 
a given finite field has been studied quite extensively, due to its connection with 
error-correcting codes via the Goppa construction. The web site https://manypoints. 
org tabulates most known results on this question. In this section we investigate 
minimal and maximal point counts of abelian varieties. 

We will say an isogeny class [A] is maximal (resp. minimal) for a fixed g and q 
when #A(F,) is the maximum (resp. minimum) of 


{#B(F,) : B an abelian variety of dim g over F,} 


7It may be feasible to prove that the limiting distribution is Gaussian by bounding the difference in 
moments directly. 
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Table 8 Maximal and minimal abelian varieties (g = 3) 


#A(Fy) Pa(T) Newton Polygon Jac? 
q=3: 343 (77 +37T+ 3)° ss ; No 
1 (7? —3T+ 3)° ss No 
qg=4: 729 (T +2)° ss No 
1 (T — 2° ss No 
q=5 1000 (4ar4sy ord No 
8 (T?—4T +5) ord No 
q=7 2197 (T24+5T+7) ord No 
27 (Fase ord No 
q=8 2744 (Pasrae: ord 272 
64 (T?-5T +8) ord No 
gq =9: 4096 (T +3)° ss 29? 
64 (T —3)° ss No 
q=11: 5832 (7? +6T +11) ord No 
216 (Sori) ord No 


While studying maximal (resp. minimal) abelian varieties is formally similar to 
studying maximal (resp. minimal) curves, it is not directly related: an isogeny class 
being maximal (resp. minimal) has nothing to do with it containing the Jacobian of 
some maximal (resp. minimal) curve. See Example 5.8 for explicit examples. 

In a similar vein one might also naively expect that, since Artin-Schreier curves 
are all maximal and known to be supersingular (see Example 5.5), an abelian variety 
A being maximal (resp. minimal) might imply that A is supersingular. Indeed, in 
some references (such as [KP19]) the terms maximal and minimal are used to refer 
exclusively to supersingular abelian varieties. However, we will see below that a 
maximal (resp. minimal) abelian variety in our sense need not be supersingular. 

When looking at the data our first observation is that for a fixed g,q we 
almost always found that there were unique isogeny classes Amax and Amin with 
L(Amax,T) = L(Amin, —T). This equality implies that Amin and Amax are 
quadratic isogeny twists. 


Example 4.8 When g = 3 and q = 3 we have P(Amin, T) = T° — 9T? + 367? — 
81T>, 10872 — 81T + 27 which is 3.3.aj_bk_add and P(Amax, 7) = T° +9T? + 
36T* + 817? + 1087? + 81T + 27, which is 3.3.j_bk_dd. 


We now explain our findings. In Table 8, we report the unique minimal and 
maximal isogeny classes for g = 3 and 3 < q < 25. These are all isogenous to 
cubes of elliptic curves, and the minimal and maximal examples for a given g are 
isogeny twists of each other; but some are ordinary and some are supersingular. 
We omit g = 2 because it is a bit anomalous: there are 7 minimal isogeny classes 
(and a unique maximal one). See Lemmas 4.9 and 4.10 for an explanation of these 
observations. 
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We now investigate these observations. Recall that the (sharpened) Weil bounds 
on an abelian variety A of dimension g over F, take the form 


[./q —-1)71 < #A,)!/8 < LO/e + D7). (4.5) 


These are precisely the maximal and minimal values appearing in Table 8. We are 
thus led to ask whether equality in the upper (resp. lower) bounds happens only 
for a power of a maximal (resp. minimal) elliptic curve, or equivalently whether 
the inequalities become strict if we restrict to simple abelian varieties of dimension 
greater than 1. This is in fact claimed in both [AHL12, Théoréme 1.1] and [AHL13, 
Corollary 2.2, Corollary 2.14], but we have already seen by an example that this 
is false for g = 2; moreover, Theorem 2.11 implies that there are infinitely many 
g for which there exists a simple abelian variety A of dimension g over F2 with 
#A(F2) = 1. 

On the other hand, by working more closely through the literature, we can recover 
an argument that the inequalities become strict. 


Lemma 4.9 For q = 5,7, assume that #A (F,)!/8 > Ng for every abelian variety 
A of dimension g defined over Fy, where Ns = 2.708 and N7 = 3.970 (reported 
to us by Kadets). Then for all q > 2, the inequalities (4.5) become strict for simple 
abelian varieties of dimension greater than 1. 


Proof First, by inspection of the proof of [AHL13, Proposition 2.17], one deduces 
that the lower bound is strict for g > 8; thus only the cases g = 3, 4, 5, 7 are at issue. 
For g = 3 and q = 4, Theorem 2.9 implies that the lower bound is strict (for g = 4 
we may also apply Theorem 2.8). For g = 5 and g = 7, one can obtain similar lower 
bounds by emulating the calculation used to prove Theorem 2.9. Kadets reports that 
a nonrigorous version of the calculation gives the lower bounds 2.708 for g = 5 and 
3.970 for g = 7, but as of this writing a rigorous calculation remains to be made. 

oO 


We raise our observations to the status of theorems below. 


Lemma 4.10 Under the hypothesis of Lemma 4.9, for every g > 1 and every 
q > 2 there exists unique maximal and minimal isogeny classes of dimension g. 
These classes are quadratic isogeny twists of each other and are a power of the 
unique maximal (resp. minimal) isogeny class of elliptic curves. Finally, the class is 
supersingular or ordinary according to whether p divides |2,/q|. 


Proof By Theorem 2.8, one always gets a maximal (resp. minimal) abelian variety 
of a given dimension over F, by taking a power of the maximal (resp. minimal) 
elliptic curve over Fz. By the discussion following Eq. (4.5), this is unique except 
for the minimal case over F2 in some genera—the isogeny class of an elliptic curve 
is determined by a single point count. In particular, since the minimal and maximal 
elliptic curve over Fy are isogeny twists of each other, the same is true of their 
powers. Oo 
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Table 9 A table of the unique simple-maximal and simple-minimal Weil polynomials for g = 3 


#A(F,) | Pa(T) NP | Jac? 
q=5: | 631 T° +87? +3474 +9373 + 170T? + 200T + 125 ord | Yes 
25 T° —8T? + 32T* — 8573 + 160T2 — 200T + 125 ord | No 
q=T7: |1561 | 7°4+117° 459744 1957? +. 413 T? +539T + 343 ord | 222 
71 T° — 1175 +59T4 — 195 T3 +. 413 T? — 539T + 343 ord | No 
q=8: |2157) |7°4+127° 46974424373 +5527? +768T +512 ord | 22? 
111 T® — 127° +69T4 — 243 T? + 552T? — 768 T +512 ord | No 
q=9: |2911) |7°4 137° 481744 3057? +7297? + 1053 T +729 ord | 22? 
169 T® — 137° +8174 — 305 T3 + 729 T? — 1053 T + 729 ord | No 


q=11:|4861 | 764+ 1575 + 10574 + 439 T3 + 1155 T? + 1815 T + 1331 ord | 222 
323 T® — 15T? + 105 T* — 439 T3 + 1155 T* — 1815 T + 1331 ord | No 
q=13:|7181 | 7%4+ 167° + 12274 +555 T3 + 1586 T?2 + 2704T +2197 ord | 222 
615 T® — 167? + 120T* — 543 T? + 1560 T? — 2704 T +2197 ord | No 
q = 16:| 12649 | T°+19T> + 16674 + 847 T3 + 2656 T? + 4864 T + 4096 ord | 7? 
1189 | T°—19T> + 166 T* — 847 T3 + 2656 T* — 4864T + 4096 ord | No 
q =17:| 14351 | 7°+ 197? + 16974 + 885 T3 + 2873 T2 +5491 T +4913 ord | 7? 
1539 | T°— 19 T? + 167 T4* — 871 T? + 2839 T? — 5491 T +4913 ord | No 
q =19:|19601 | 76421 7° +201 74 + 11197? + 3819 T? + 7581 T + 6859 | ord | 77? 
2113 | T° — 217° +197 T4 — 1085 T? + 3743 T? — 7581 T + 6859 | ord | No 
q = 23: | 32671 | 7°4 247? + 25874 + 1591 T? + 5934 T? + 12696 T + 12167 | ord | 7? 
4049 | T° — 247? +258 T4 — 1591 T3 + 5934 T? — 12696 T + 12167 | ord | ??? 
q = 25: | 40391 | 7° 4257? + 281 74 + 1809 T? + 7025 T? + 15625 T + 15625 | ord | 7? 
5473. | T° — 25T? +281 T4 — 1809 T? + 7025 T? — 15625 T + 15625 | ord | 27? 


Remark 4.11 As asserted in Lemma 4.10, whether or not the maximal and minimal 
elliptic curves over F,, and hence the resulting abelian varieties, are supersingular 
or ordinary depends on whether or not [2,/q| is divisible by p. When g = p, this 
divisibility holds only for p = 2,3 (as otherwise 0 < 2,/p < p). When q isa 
square, so g = p*°, we have \2./q| = 2,/p4 = 2p“ which is obviously divisible 
by p. In other cases, g = p*°t! for some positive integer e and we are asking 
whether the base-p expansion of 2,/p has a zero in the e-th position after the radix 
point; this can occur but is rather sporadic.® 


Following the discussion in Sect. 2.7, we now restrict attention to simple abelian 
varieties. We say that an isogeny class [A] of dimension g over F, is simple-maximal 
(resp. simple-minimal) if #A(F,) is maximal (resp. minimal) among simple abelian 
varieties of dimension g over F,. In Table 9, we report the unique simple-maximal 
and simple-minimal isogeny classes for g = 3 and 5 < q < 25. For each of 
q = 2,3,4, there are 2 simple-minimal isogeny classes (and a unique simple- 
maximal one). 


8The radix point is the analogue of the decimal point for a general base expansion. 
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Table 10 A table of the unique simple-maximal and simple-minimal Weil polynomials for g = 4 


#A(Fy) | Pa(T) NP | Jac? 
q=3: | 979 T8477) +2779 +727? + 14374 + 2167? + 24377 + | ord | 22? 
189T +81 


5 T8671 4+13 T°—10 T°? +T*—3073 +117 T7—162T+81 | ord |No 


q=4: | 2521 T8 4977 +427°% + 1327) 4+ 305 74 +528 T? + 67277 + | ord | 22? 
576T + 256 
29 T8 —9T! +41 7% — 125 T> 4+ 285 T4 — 500 T? + 65672 — | ord | No 
576 T + 256 
q=5: | 5599 TS +1177 +62T° + 229 T? +601 T* + 11457? 4 ord |??? 
1550 T? + 1375 T + 625 
61 T§ —10T7 +45T° — 130 T? +305 T* — 650T? 4 ord | No 
1125 T? — 1250T + 625 


Table 11 Bounds and extreme values for #A (F,)!/ 8 for a simple abelian variety A of dimension g 
over F,. The bounds for q = 2, 3, 4 are taken from [Kad19]; the bounds for g = 5 were computed 
numerically (but not rigorously) by Kadets using the same method (see Lemma 4.9) 


q g Lower bound Minimum Maximum Upper bound 


We make a few curious observations about this data which we are unable to rigor- 
ously explain. For one, all of the simple-maximal and simple-minimal examples are 
ordinary. For another, the simple-maximal and simple-minimal varieties are most 
often isogeny twists of each other, but not always (see g = 5, 13, 17, 19); in any 
case, they have opposite sign patterns (the coefficients of P4(T) are positive for A 
simple-maximal and alternate in sign for A simple-minimal). Finally, note that the 
simple-maximal variety for g = 5 is a Jacobian. 

In Table 10, we give similar information for g = 4, and in Table 11 we compare 
the extreme values of #A (F,)!/ 8 to the bounds given in Theorem 2.9. 


5 An Isogeny Class Scavenger Hunt 


In this section, we describe a number of examples related to questions or results in 
the literature. 

Several of these examples involve Jacobians of curves, whereas the LMFDB does 
not currently contain complete information about Jacobians of curves of genus at 
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least 4 (Sect.6.2). To generate these examples, we used Sage [S*20] to exhaust 
over hyperelliptic curves, computing zeta functions until we found one of the desired 
form. (This can also be done in Magma [BCP97].) 


5.1 Some Basic Examples 


Example 5.1 (Supersingular Elliptic Curves) One of the first interesting exam- 
ples of abelian varieties are supersingular elliptic curves defined over finite fields. If 
E is an elliptic curve defined over a finite field, we have 


2 
c(E/E,,T) = +47 
== ar) 
with a € Z and |a| < 2,/q by the Hasse-Weil bound. In addition, if E is 
supersingular then a = 0 mod p, for p the characteristic of F,. Therefore if p 
is prime and p > 5, then there is a unique isogeny class of supersingular elliptic 
curves over F',, namely the one with a = 0. This means that 


Pa(T) =T*+q=(T -ifgyT +iv@: 


a similar phenomenon holds in more generality. Indeed, when A/F, is a simple 
abelian variety, A is supersingular if and only if 74 = ¢./q for ¢ a root of unity: 
namely, if A is supersingular, then 2/,/q is an algebraic number with norm | in 
every finite and infinite place, so by Kronecker—Weber it is a root of unity. 

In contrast, while an elliptic curve defined over a finite field has p-rank zero 
if and only if it is supersingular, this does not hold in higher dimension, even for 
simple abelian varieties. For example, isogeny class 3.2.ac_c_ac is geometrically 
simple of p-rank 0, but has Newton polygon slopes [1/3, 1/3, 1/3, 2/3, 2/3, 2/3] 
and therefore is not supersingular. (This is the smallest dimension for which p-rank 
0 does not imply supersingular.) Conversely, a supersingular isogeny class always 
has p-rank 0. 

We finally comment on the endomorphism rings of supersingular elliptic curves. 
While every supersingular elliptic curve has geometric endomorphism ring a 
maximal order in a quaternion algebra, it is not the case that every endomorphism 
is defined over the field of definition of the elliptic curve. For example, there are 
supersingular elliptic curves defined over F, for each prime p (belonging to the 
isogeny class with a = 0, as above), but each of them has endomorphism ring 
isomorphic to an order in an imaginary quadratic field and only acquires its extra 
endomorphisms over an extension of F,. For example, isogeny class 1.3.ad has 
endomorphism degree 6, by which we mean that its geometric endomorphisms are 
defined over F36; isogeny class 1.2.ac has endomorphism degree 4; and isogeny 
class 1.2.a has endomorphism degree 2. 
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Example 5.2 We now give examples of particularly uncomplicated Weil polyno- 
mials for g = 2 and g = 4. These polynomials can be useful to check conjectures 
when one wants to produce examples quickly by hand. We first observe that by 
Lemma 2.2 a Weil polynomial P4(7) = P(T) of an abelian variety A of dimension 
g can be written as 


8 
P(T)=|]@?-B T+ (5.1) 
i=1 
where 6B; = a; + q/a; is totally real, and a,...,@2g are the roots of the Weil 
polynomial, arranged so that ojo2,-; = q fori=1,..., g. 


When g = 2, we can expand this to obtain: 
P(T) = T* — (Bi + B2)T? + (2g + BiBa)T* — (Bi + fr)qT + 4°. 
For r a positive integer, choosing Bj = ./r and B) = —./r gives a nice family of 
examples. In fact, for any positive integer a not divisible by p such that 2q > |al, 
letting r = 2q — a gives the characteristic polynomial 


P(T)=T*+aT’?+¢@° 


for an abelian surface. (Note that we require p { a to ensure that e, = 1; see 
Sect. 2.3.) This isogeny class has Weil g-numbers 


(a ae ee = 
) - 2 2 2 


Even more concretely, for g = 5 any a € {—9, —8,..., 8, 9} is admissible. In the 
case a = —| this is 2.5.a_ab. 
One can generalize the above example by considering (5.1) where 6), ..., Bg are 


totally real algebraic integers which form a union of Galois orbits over Q and q is a 
sufficiently large prime power. 

Another pleasing family of isogeny classes can be obtained by considering the 
case of g = 4 with two isogeny factors of dimension 2 each with Weil polynomial of 
the form T++a;T*+q? as above. In this case this isogeny class has Weil polynomial 


(T44a,T?+q?)(T4 +027? +q7)=T8 + (ay tar) To + (2g? +a1a2)T*+ (ai tar)q?T+q4, 


which we can specialize to aj = c+ bs/d and az = c — bVd, where aj and a are 
algebraic integers, and as before 2g > |a;| for each i, to give 


a, +a) = 2c, ajay = c* — db’, 
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and 


P(T) = T® + 2cT® + (2g? +c? —db*)T* + 2cq’T +q°. 


In the special case c = 1,b = 2,d = 2,q =5S this is 4.5.a_c_a_br. 


Example 5.3 ([AP11, Example 2.1] and 4.3.a_a_a_g) Consider the hyperelliptic 
curve given by 


C: yee ae $l bo 4 oe a ee. 
The genus of this curve is 4 and 
L(C/F3, T) = 81T®8 + 6T* + 1. 
This example was computed by writing down the shape of the zeta function and 


equating terms in the truncated Taylor series. 


Example 5.4 ([Voi05, Example 2.3] and 3.2.a_a_f ) The isogeny class of the 
Jacobian of a curve C over Fy can be identified by knowing #C(F,) for 1 <r < g. 
In the examples where C is a projective curve defined by x7 y + x*z + y*z = Oor 
xy + y3z+-xz3 = 0 over Fp, C has genus 3 and we can compute point counts “by 
hand” using a presentation of the field: 


r 12,3) 4 
#C (For) |/3|4 24|17 


This allows us to solve explicitly for the L-polynomial (see op. cit.) 
LO bro 4 Sr 1, 


Example 5.5 Artin-Schreier curves are fan favorites. An affine Artin-Schreier 
curve has a model 


Upg: y? —y= fax) 


where fa(x) € F,[x] is a polynomial of degree d. After a desingularization of the 
naive projective model one gets a proper model X = X f,q. In the case where q = p, 
the genus of this curve is g = (d — 1)(p — 1)/2, and for d = 5 and p = 3 the curve 
has genus 4 so its Jacobian will be in our database. How can we find it? It turns out 
that the point counts of such a curve can be explicitly computed: 


#Xpg(Fqn)=1+] D> SG,n)] +4". 
veF,\(1) 
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In the above expression the y’s are additive characters and the S’s are character 
sums given by 


n-1 ; 
Sn) = YO vty n(fO)), — Ty n(@ = Dat. 
beF gn i=0 


All of the additive characters of F, are parametrized by a € F, and take the form 
Xa: Fy — C where 


aTtRy /F y (d) 
Xalb) = op moe ek 


For details on this computation see [Mue13, §2.1 and §2.2]. 

In the special case where f (x) = x>, we computed the values of these characters 
using Sage, and found that #X(F3.) = 4, 10, 28, 154, 244 forn = 1,2,3,4,5. 
Searching our database gives 4.3.a_a_a_s, as the isogeny class of the Jacobian. This 


isogeny class contains supersingular abelian varieties, as expected. 

In the special case f(x) = x + 2x + 1, we again compute 
#X (F3x) = 1, 7,55, 91, 271 forn = 1, 2,3, 4,5, and find that the Jacobian belongs 
to 4.3.ad_d_j_abb, another non-simple supersingular isogeny class; this time the 


isogeny factors are not isogenous to each other. 


Example 5.6 The isogeny class 2.2.a_ad provides an example of a principally 
polarizable isogeny class of abelian varieties of dimension 2 over F2 which does 
not contain a Jacobian (Example 5.21 is another example in higher dimension). 

Also, as discussed previously, there exist principally polarizable abelian varieties 
which have isogeny factors whose isogeny classes are not principally polarizable. 
For instance, the g = 4, g = 5 isogeny class 4.5.ak_bp_adq_hc has a g = 2 isogeny 
factor 2.5.ac_ab which contains no principally polarizable abelian surface. 

For interested readers, we point out that our methods don’t allows us to determine 
if the particular g = 4 simple isogeny class 4.5.ag_o_au_bj contains a principally 
polarizable abelian variety. We are unable to do so whenever the isogeny class is 
not ordinary, and its associated CM field K = Q(z) (which in this particular case 
is 8.0.268960000.3) is unramified over its totally real subfield K+, and every prime 
of K+ dividing w — q/z is not inert in K/K +. In short, the condition in [How96, 
Theorem 1.1] summarized in Sect.3.6 does not completely answer the question: 
“Does this non-ordinary isogeny class contain a principally polarizable variety?” 


Remark 5.7 Howe points out to us that the 2-dimensional isogeny class 
2.2.a_ad presented in the above example is one from a family that is in the 
Appendix to [MN02]. The theorem in op. cit. is that no polynomial of the form 
T* + (1 — 2q)T? + q? is the Weil polynomial of a Jacobian of a curve over Fy. 
One can also see that such an isogeny class contains a principally polarized abelian 
variety. It is the restriction of scalars from F,2 to Fg of an elliptic curve with Weil 


polynomial T? + (1 — 2q)T + q?. 
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Example 5.8 In this example we show that maximality or minimality of a curve has 
nothing to do with the associated Jacobian being maximal or minimal as an abelian 
variety in the sense of Sect. 4.7. Indeed, the isogeny class 3.3.aj_bk_add is minimal 
for g = 3 and g = 3 with #A(F3) = 1, but it does not contain a Jacobian since its 
virtual curve count has #C(F3) = —5. Therefore, a minimal curve of genus 3 over 
F3 cannot have minimal Jacobian. 


Example 5.9 When g = 6 and g = 2 there exist isogeny twists 6.2.a_ac_a_c_aa 
nant). The two number fields have a different number of places above 2 (3 in one 
case and 4 in the other), so the Brauer invariants of their endomorphism algebras are 
different: (0, 0, 0) vs (0, 0, 0, 0). 


Example 5.10 (Communicated to us by Howe) Consider isogeny class 2.2.b_b, 
which has characteristic polynomial P(T) = T+ + T? + T? + 2T +4. The ring 
Z[z, q/m] is a maximal order in a field with class number one, and therefore this 
isogeny class contains a single isomorphism class of abelian varieties. Furthermore, 
this abelian variety must have a principal polarization (by [How95] or by the more 
general Theorem 1| of [How96]). This shows that while over an algebraically closed 
field of characteristic 0 every abelian variety is isogenous to an abelian variety that 
does not admit a principal polarization, this is not true in general (cf. Sect. 3.6). 


5.2 Supersingular Curves 


It is not clear whether every eligible Newton polygon can occur for curves of a given 
characteristic (see Remark 5.13), and the following is a related long-standing open 
problem. 


Question 1 For every prime p and every positive integer g, does there exist a genus 
g curve over F,, whose Jacobian is supersingular? 


This is known for g < 4 [KHS19]. It is also known for all g when p = 2 
[vdGvdV95]; however, these curves cannot always be taken to be hyperelliptic 
[SZ02]. Some higher-genus cases are treated in [LMPT18, LMPT19]; however, 
the following example is not covered by those papers (nor by [vdGvdV95], which 
handles some genera for p > 2). 


Example 5.11 Let C be the hyperelliptic curve over F3 given by 


y- ag ee a ge ee: 
Then 


L(C/iy, TY=1 437? 4+ gir? 4 2437, 
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so the Jacobian A of C is supersingular and belongs to isogeny class 5.3.a_d_a_a_a. 


Remark 5.12 One can also consider curves whose Jacobians are superspecial, 
meaning that they are isomorphic (not just isogenous) to a product of supersingular 
elliptic curves. Ekedahl [Eke87] showed there do not exist superspecial genus 4 
curves in characteristics 2 and 3, and asked whether conversely they exist in all 
characteristics at least 5; this question was answered negatively by Kudo—Harashita 
[KH17], who showed that none exist in characteristic 7. 


Remark 5.13 The moduli space M, of curves of genus g has dimension 3g — 3, 
which is much smaller than the length of a maximal chain of inclusions of Newton 
strata in Ag, which is asymptotic to g”/4 (see Corollary 2.4). However, unlike Ag, 
which has irreducible strata except for the supersingular case, Newton strata in Mg 
can be much more reducible. While it is conceivable that there are enough strata in 
Meg to account for all possible Newton polygons, it does not seem likely. 


5.3 Ordinarity and Angle Ranks 


The following is a conjecture of Ahmadi—Shparlinski. 


Conjecture 5.14 ([AS10, §5]) Every ordinary geometrically simple Jacobian has 
maximal angle rank. 


This is verified in the database in dimensions 2 and 3. This is a theorem in 
dimension 2, even without the ordinary condition: [AS10, Theorem 2]. It is also 
a theorem in dimension 3, but this time it requires the ordinary condition: [Zar15, 
Theorem 1.1]. 

We verified the conjecture in dimension 4 over F2 as follows. According to the 
LMFDB, there are 52 isogeny classes of ordinary, geometrically simple abelian 
varieties with angle rank at most 3 (in fact they are all equal to 3). Since at the time 
of verification the LMFDB did not yet contain full data about whether an isogeny 
class in dimension greater than 3 contains a Jacobian, we used the fact that every 
nonhyperelliptic genus 4 curve is the intersection of a quadric and a cubic in P? to 
compute the zeta functions of all genus 4 curves over F2. We found 620 distinct zeta 
functions, none of which occur among the previous list of 52. (This result has been 
independently confirmed by Xarles [Xar20], and the corresponding data added to 
the LMFDB.) 

As an aside, note that [Zar15, Theorem 1.1] implies that for each of the 52 
isogeny classes in the previous paragraph, the endomorphism algebra must contain 
an imaginary quadratic field, which we have confirmed. 

By contrast, the conjecture fails in dimension 4 over F3 and Fs, as shown by 
the curves given in Example 5.15. We also note in passing that Conjecture 5.14 is 
incompatible with Conjecture 5.20 below. 
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Example 5.15 Let C be the hyperelliptic curve over F3 given by 
y? ay Barge! or x, 
Then 
MCh Mela Tra ar Sar 0 1279? 4 18 = 29T 8 1r", 


so the Jacobian A of C belongs to isogeny class 4.3.ab_c_ae_ac. We see that A is 

ordinary, geometrically simple, and has angle rank 3. It thus constitutes a counterex- 

ample to the Ahmadi—Shparlinski conjecture (Conjecture 5.14). Consistently with 

Zarhin’s theorem, the endomorphism algebra contains the field Q(./—7). 
Similarly, let C be the hyperelliptic curve over F5 given by 


4 =e? +3943 4k, 
Then 
CMs Tra VoT 4 or" Aree 16r* = 20r° 4 50r* = 1447’ 4 6257". 


so the Jacobian A of C belongs to isogeny class 4.5.ab_c_ae_q. Again, A is ordinary, 
geometrically simple, and has angle rank 3. The endomorphism algebra contains the 


field Q(./—15). 


5.4 Function Fields of Class Number One 


Example 5.16 ({Stil4] and 4.2.ad_c_a_b) The following is the LMFDB annota- 
tion for 4.2.ad_c_a_b. In [Stil4], Stirpe exhibited an example of a genus 4 curve 


C/F> for which 
LOC/Es, Tym) SBT 20 eT + BT KH aT” 4. 167". 


The Jacobian A of this curve belongs to isogeny class 4.2.ad_c_a_b. 

This example is notable because C has the largest possible genus among curves 
over finite fields with trivial class group, and because it refuted a published result 
from almost 40 years earlier. In [LMQ75], it was shown (correctly) that there are 
seven such curves of genus at most 3 and at most one of genus 4; it was also 
claimed (incorrectly) that the genus 4 case could be ruled out. Correct proofs of 
the classification can be found in [MS15] and [SS15]. 
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5.5 Hypersymmetric Abelian Varieties 


Let A be an abelian variety over a field k D F,. Following Chai—Oort [CO06, 
Definition 2.1], we say that A is hypersymmetric if 


End(Ap)z,, = End(Az[p®)). 


These are meant to provide a positive-characteristic analogue of CM points in the 
moduli space of abelian varieties. 

According to [CO06, Theorem 3.3], for a simple abelian variety, one can read off 
whether it is hypersymmetric explicitly from the Frobenius polynomial. 

Here is an explicit example. 


Example 5.17 We exhibit a simple hypersymmetric abelian threefold over Fg by 
verifying that 3.8.ag_bk_aea satisfies the conditions of [CO06, Conclusion 3.6]. The 
Newton polygon has slopes 1/3 and 2/3 each with multiplicity 3, so it is balanced 
in the sense of [CO06, Definition 3.4]. The prime 2 splits completely in Q(z) = 
Q(./—7), and the Brauer invariants of the endomorphism algebra at the places above 
2 are again 1/3 and 2/3. 

The same conditions are satisfied by the quadratic isogeny twist 3.8.g_bk_ea. We 
have checked that other than elliptic curves (which are all hypersymmetric), these 
are the only examples of hypersymmetric abelian varieties currently found in the 
LMFDB. 


5.6 Isomorphic Endomorphism Algebras and Different 
p-ranks 


Example 5.18 This example is a modified version of [Gon98, Example 4.2]. (That 
example starts over F3 rather than F2, but we do not currently have abelian 
threefolds over F27 in the LMFDB.) 

Let A be an abelian threefold over F2 in the isogeny class 3.2.ad_c_b. Then A is 
simple, ordinary, and of p-rank 3, and its endomorphism algebra is Q(¢7). Although 
A is not geometrically simple, its base change to Fs, which belongs to the isogeny 
class 3.8.ag_bd_adf, is again simple. 

Let B be an abelian threefold over Fg in the isogeny class 3.8.ag_i_i; the Weil 
number for B is twice that for A. Here B is geometrically simple of p-rank 0, and 
its endomorphism algebra is again Q(¢7). 


Example 5.19 There are also examples where both A and B are geometrically 
simple and have the same endomorphism algebra but different p-ranks. For 
example, abelian varieties in the isogeny class 3.2.a_c_c have p-rank 0 while abelian 
varieties in the isogeny class 3.2.d_f_h have p-rank 3; both have endomorphism 
algebra isomorphic to the number field 6.0.679024.1. 
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5.7. Abelian Fourfolds as Jacobians 


For g > 4, a generic principally polarized abelian variety is not isomorphic to a 
Jacobian for dimension reasons. However, when working up to isogeny, it becomes 
much less clear what to expect. Using ideas from the theory of unlikely intersections, 
Shankar—Tsimerman [ST18] have made numerous observations about this question, 
including the following conjecture. 


Conjecture 5.20 ([ST18, Conjecture 2.5]) Every 4-dimensional abelian variety 
over F’, is isogenous to the Jacobian of some (possibly reducible) stable curve. 


Notably, however, it is not predicted that a 4-dimensional abelian variety over 
F;, is isogenous to the Jacobian of a curve over F,. This can fail to occur, as in the 
following example. 


Example 5.21 Consider the isogeny class 4.2.c_c_ac_af over F. This class con- 
tains a principally polarized abelian fourfold A which is not isogenous to a Jacobian: 
if it were, the corresponding curve would have a negative number of F:-points (see 
Sect. 3.3). However, the base change of A to Fy = F2[a], which belongs to the 
isogeny class 4.4.a_c_i_j, is isogenous to the Jacobian of the hyperelliptic curve 


ytaxt+x4x%74+x4+a)y =x? 42°. 


We can also stress-test the conjecture by considering the following example. 


Example 5.22 Consider the isogeny class 4.2.a_a_b_af over F2. This isogeny class 
is geometrically simple and ordinary. It cannot contain a Jacobian over any subfield 
of Fi¢: if it did, the corresponding curve would have a negative number of Fj¢- 
points. In addition, Howe’s criterion (see Sect. 3.6) implies that this isogeny class 
cannot contain a principally polarized abelian fourfold over Fs for any odd integer 
g. Consequently, if Conjecture 5.20 holds, then this isogeny class contains a 
Jacobian over F2¢ for some even integer g > 6. 


5.8 Distinguishing Isogeny Classes by Point Counts 


The Weil polynomial of an abelian variety of dimension g has g unknown 
coefficients, so it is expected that these can be solved for using g point counts. It 
turns out that we can often do better than this. This exotic phenomenon is governed 
by information-theoretic heuristics (see Remark 5.25). 


Example 5.23 For A a 5-dimensional abelian variety over F2, A is determined up 


to isogeny by the tuple (#A(F,;) : i = 1,..., 4). This is best possible: for example, 
an abelian variety A in any of the isogeny classes 


5.2.ab_b_c_d_ac, 5.2.ab_c_a_a_i, 5.2.ab_c_b_a_d 
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satisfies (#HA(F2), #A(F4), #A(Fg)) = (42, 2520, 80262). By contrast, over F3, a 
5-dimensional abelian variety is determined up to isogeny by the tuple (#A(F3;) : 
b= 14.2853). 


Example 5.24 Similarly, for A a 6-dimensional abelian variety over F2, A is 
determined up to isogeny by the tuple (#A(F>:) : i = 1,..., 4). This is best possible 
even if we restrict to simple abelian varieties: for example, an abelian variety A in 
any of the isogeny classes 


6.2.ab_ab_b_g_ab_aj, 6.2.ab_b_ab_ac_f_ad, 6.2.ab_c_ab_ad_f_ap 


is geometrically simple and satisfies (#A(F2), #A(F4), #A(Fs))=(42, 4032, 
246078). 


Remark 5.25 It is not known exactly how many initial point counts are needed 
to identify a g-dimensional abelian variety over F, up to isogeny (for known g 
and q). The fact that the complete sequence of point counts determines the Weil 
polynomial is already nontrivial; it follows from a theorem of Fried [Fri88] (see also 
[Hil05]). Using the Weil bounds, it is shown in [Ked06b] that at most max{18, 2g} 
counts suffice; however, Noam Elkies has pointed out that on information-theoretic 
grounds, one should expect the number of counts needed to be about g/2, and 
indeed this is consistent with these examples. Namely, we need to distinguish 
among O(q8&t+)/4) Weil polynomials (Sect. 4.1), whereas the Lang-Weil bound 
(Sect. 2.7) implies that the number of possible values for the tuple (#A (Fi ))j_ , is 


O(q"’ 2), 


6 Possible Generalizations and Bottlenecks 


We conclude with some discussion about possible future directions for this work. 


6.1 Bottlenecks 


We first identify some bottleneck steps that limited our original work, and which we 
would like to overcome. 

As described in Sect. 3.8, we currently present Frobenius angle ranks which were 
computed nonrigorously using floating-point arithmetic, because it is not feasible to 
run the rigorous algorithm on all cases in the database. However, since we also 
compute Galois groups, we can use those to certify some cases as having maximal 
angle rank. This covers the vast majority of cases, which might make it feasible 
to run the rigorous algorithm on the rest, but our present methods rely on the 
computation of a splitting field, which is also very costly. 
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As has come up on several occasions already, we do not currently implement 
positive Jacobian testing for abelian varieties of dimension greater than or equal to 
4. See Sect. 6.2 for further discussion. 

One obstruction to adding complete tables of abelian varieties for other pairs 
(g, g) is the overall size of the dataset. Using Eq. (4.1), including (7, 2) would add 
about 2.2 million isogeny classes, (6, 3) about 10 million, (5, 4) about 2.2 million, 
(4, 7) about 700,000, and (3,27) about 450,000. For comparison, the database 
currently contains about 3 million isogeny classes and takes up about 10GB. It 
would certainly be feasible to extend, but a line needs to be drawn somewhere. 
Moreover, the computational time required per isogeny grows quickly with g, so 
adding data with g > 5 takes more effort than suggested by the number of classes 
alone. 

In lieu of enlarging the tables, one could also implement the computation of the 
data presented in LMFDB on a case-by-case basis for individual isogeny classes. 
One piece of data that would be difficult to compute in this way is the isogeny twists, 
which we currently do by finding hash collisions across the entire table (Sect. 3.4). 

It would also be useful to have a mechanism to produce random elements of the 
set of isogeny classes for large g and q. It should be possible to effectively simulate 
the uniform distribution on isogeny classes by computing analogues of the isogeny 
Sato-Tate distribution in which one projects onto the first k polynomial coefficients 
(the isogeny Sato-Tate distribution being the case k = 1). An alternative approach 
may also be to take the existing Weil polynomial iterator discussed in Remark 3.1 
and replace iterations over integers in an interval with uniform random samples. 


6.2 Jacobians 


Currently the LMFDB does not identify any isogeny class of abelian varieties of 
dimension at least 4 as containing a Jacobian. For dimension 4, it may be possible 
to exhaust over isomorphism classes of genus 4 curves using the fact that every 
such curve is either hyperelliptic or the transverse intersection of a quadric and a 
cubic surface in P*; see [Sav03] for a similar calculation. For curves of genus 5, 
the analogous assertion is that every such curve is either hyperelliptic, trigonal, 
or the transverse intersection of three quadrics in P*. For curves of genus 6, 
the analogous assertion is that every such curve is either hyperelliptic, trigonal, 
bielliptic, isomorphic to a smooth plane quintic, or birational to a plane sextic with 
four double points [ACGH85, Exercises V.A]. (Beware that the previous assertions 
are made over an algebraically closed field.) 
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6.3 Isomorphism Classes 


Beyond the current data in the LMFDB, it would be extremely desirable to tabulate 
abelian varieties up to isomorphism, not just up to isogeny. The easiest cases for 
this are those of ordinary abelian varieties, for which an explicit description of 
isomorphism classes within an isogeny class has been given by Deligne [Del69]; 
abelian varieties over F,, for which a similar description has been given by 
Centeleghe-Stix [CS15]; and almost ordinary abelian varieties [OS20]. The recent 
work of Marseglia [Mar19] has made great strides towards making these methods 
practical at the scale of the LMFDB, and we plan on working with him on including 
data on isomorphism classes into the database. 

In order to handle nonordinary abelian varieties over nonprime fields, it is 
probably necessary to go back to the proof of Honda’s theorem, by constructing 
CM abelian varieties over number fields and then reducing them modulo primes. 
Note that whereas Honda’s original approach to this in [Hon68] used complex 
uniformization and GAGA, a more recent construction of Chai—Oort [CO15] gives 
a more algebraic approach that might be easier to implement as an algorithm. 


6.4 K3 Surfaces and Higher Weight 


It would be natural to attempt a similar compilation of other types of algebraic 
varieties over finite fields and their zeta functions. A strong candidate class for this is 
K3 surfaces, for which a weak version of the Honda-Tate theorem is known [Tae16, 
Ito19]. More precisely, for a given candidate Weil polynomial in this setting, one can 
prove there exists a corresponding K3 surface in some base change of this isogeny 
class. The code for tabulating Weil polynomials described in Sect. 3.1 can produce 
lists of possible zeta functons for K3 surfaces over Fg for small g (this has been 
tested up tog = 5). 

This suggests the question of trying to determine exactly which zeta functions 
occur for K3 surfaces over a given field. An indication of the difficulties involved 
can be seen in [KS16], where a complete tabulation of smooth quartic surfaces in 
P> over F> and their zeta functions was made (computing the latter by enumerating 
points); while this search did realize every eligible zeta function that could not occur 
for any other type of K3 surface, not every zeta function that could have appeared did 
so. Furthermore, certain zeta functions can only appear for K3 surfaces of very large 
degree, which would be very difficult to write down explicitly (the moduli space 
of K3 surfaces of a given degree becomes increasingly hyperbolic as the degree 
increases). 

A closely related case is that of cubic fourfolds. Some examples of zeta function 
computations to resolve specific existence questions for cubic fourfolds can be 
found in [AA18] and [CHK19]. 

In another direction, one can also identify a class of surfaces with small invariants 
(genus, irregularity, K~); identify all Weil polynomials over F, (for some given 
small values of g) which could arise from a surface with the given invariants; then 
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exhaust over the surfaces in question to see which polynomials arise and what 
“isogeny classes” they fall into. (In general it is not clear what geometric conditions 
correspond to an equality of Weil polynomials. One nontrivial example is that 
surfaces which are derived equivalent, meaning that they have isomorphic bounded 
derived categories of coherent sheaves, have the same zeta function [Hon15].) In 
some cases, candidate polynomials can be ruled out because they would predict 
impossible point counts on the underlying variety; for example, this happens for K3 
surfaces over F2 as shown in [KS16]. 


A Tables and Figures 


Table A.1 shows how the number of distinct Galois groups and Newton polygons 
varies with dimension for simple isogeny classes; see Sects. 2.4, 2.5, 4.2 and 4.3 for 
further discussion. Table A.2 gives counts of the number of simple isogeny classes 
for each g < 5 with every possible Galois group; see Sect. 4.2 for the definitions 
of a(G) and a(G) (maxp refers to the maximum gq for which we have data). Table 
A.3 gives counts of the number of simple isogeny classes for each gq < 5 with every 
possible Newton polygon; see Sect. 4.3 for a discussion of how we expect these 
counts to grow as a function of g and the elevation of the Newton polygon, and 
note that the supersingular Newton polygon does not occur among simple isogeny 
classes with g = 5 and gq < 3. Tables A.4—A.9 give the counts of the number of 
simple isogeny classes grouped by g, g, Newton polygon, Galois group and angle 
rank; see Sect. 4.4 for further discussion of the relationship between angle rank and 
the Newton polygon. Table A.10 describes the possible Brauer invariants that can 
occur for endomorphism algebras of simple isogeny classes; see Sect. 4.4 and Figs. 
A.2-A.7 for further discussion of the centers of these algebras, which are not shown 
in this table. 

Figure A.1 graphs the possible Newton polygons in dimension 3; see Sects. 2.4 
and 4.3 for further discussion. Figures A.2-A.7 show bar charts for the distribution 
of normalized discriminants for the Weil polynomials of simple isogeny classes 
and the number fields generated by them; see Sect. 4.5 for the normalizing factor 
and further discussion. Figures A.8—A.10 compare the predicted isogeny Sato-Tate 
distribution from Sect. 4.6 with the observed point counts of #A(F,) for g = 2, 3,4 
and selected values of q. 


Table A.1_ The number of 


ee é Dimension | Num of groups | Num of Newton polygons 
distinct Galois groups and : 


distinct Newton polygons, by ! 2 7 
dimension 2 4 3 
3 5 3 
4 30 8 
5 9 12 
6 46 20 
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Table A.2 Counts of isogeny classes by Galois group; a(G) and Gmaxp(G) 
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Dimension G a(G) Gmaxp(G) q=2 q=3 q=4 q=5 
1 1T1 —oo 2 

1 2T1 1 1.000 5 7 7 9 

2 2T1 1 0.000 1 1 1 

2 4T1 1/2 0.370 2 2 2 

2 4T2 1/2 0.661 11 16 14 29 

2 4T3 1 0.997 8 16 30 52 

3 6T1 1/3 0.199 4 2: 6 

3 6T3 1/2 0.557 14 32 84 88 

3 6T6 1 0.588 14 34 52 94 

e) 6T11 1 0.999 48 280 676 1850 
4 4T2 1/2 —0o 6 

4 4T3 1 —0o 4 

4 8T2 1/4 0.267 13 20 14 22 

4 8T3 1/4 0.259 4 13 23 20 

4 8T4 1/4 0.095 1 2 3 

4 8T6 1/3 0.250 8 18 

4 8T9 1/2 0.519 41 112 195 405 
4 8T10 1/2 0.334 12 32 34 48 

4 8T11 1/2 0.155 2 2 6 

4 8T12 1/4 0.267 6 12 8 22 

4 8T13 1/4 0.228 2 16 8 14 

4 8T17 1/2 0.199 2 8 10 

4 8T18 1/2 0.581 20 116 254 832 
4 8T19 1/2 —0o 4 

4 8T20 1/2 0.060 2 2 2 

4 8T22 1/2 0.180 8 

4 8T23 1/3 0.155 8 4 6 

4 8T24 1/2 0.656 92 362 630 1984 
4 8T26 1/2 0.180 4 8 

4 8T27 1 0.429 24 66 144 
4 8T28 1/2 0.259 2 8 20 

4 8T29 1/2 0.448 8 42 44 178 
4 8T30 1/2 —0o 2 

4 8T31 1 0.408 12 34 112 
4 8T32 1/2 0.120 4 

4 8T35 1 0.706 34 356 1206 3546 
4 8T38 1 0.467 20 100 224 
4 8T39 1/2 0.745 62 558 1214 5554 
4 8T40 1/2 0.199 4 4 10 

4 8T44 1 0.989 368 4986 23272 93506 
5 10T1 1/5 0.114 4 

5 10T5 1/4 0.334 18 58 


(continued) 
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Table A.2 (continued) 


Dimension G a(G) Gmaxp(G) q=2 q=3 q=4 q=5 
5: 10T11 1/4 0.057 2 
5 10T14 1 0.268 8 26 
5 10T22 1/2 0.562 100 932 
5 10T23 1 0.379 14 100 
5 10T29 1 0.347 4 68 
5 10T36 1 0.356 2, 76 
5 10T39 1 0.999 6178 189514 
6 12T2 1/6 0.253 18 

6 12T3 1/6 0.061 2 

6 12T6 1/4 0.096 3 

6 12T10 1/4 0.326 41 

6 12T18 1/4 0.202 10 

6 12T21 1/2 0.000 1 

6 12723 1/4 0.267 21 

6 12T24 1/4 0.141 5 

6 12T25 1/2 0.351 55 

6 12T26 1/4 0.000 1 

6 12T28 1/3 0.061 2 

6 12T37 1/4 0.157 6 

6 12T41 1/4 0.061 2 

6 12T48 1/2 0.487 259 

6 12T76 1/4 0.243 16 

6 12T77 1/2 0.382 78 

6 12T78 1/3 0.425 128 

6 12T79 1/2 0.061 2 

6 12T90 1/2 0.309 34 

6 12T101 1/2 0.400 96 

6 12T103 1/2 0.141 5 

6 12T125 1/2 0.323 40 

6 12T134 1 0.202 10 

6 12T135 1 0.061 2 

6 12T136 1/2 0.061 2 

6 12T138 1/2 0.061 2 

6 12T139 1/2 0.560 595 

6 12T148 1/2 0.061 2 

6 12T186 1/2 0.122 4 

6 12T193 1 0.243 16 

6 12T208 1 0.304 32 

6 12T219 1/2 0.656 1784 

6 12T222 1 0.415 114 

6 12T224 1 0.061 2 


(continued) 
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Table A.2 (continued) 
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Dimension G a(G) Gmaxp(G) q=2 q=3 
6 12T226 1/2 0.122 4 

6 12T227 1 0.182 8 

6 12T236 1/2 0.441 152 

6 12T237 1/2 0.182 8 

6 12T240 1 0.061 2 

6 12T250 1 0.482 244 

6 12T255 1 0.182 8 

6 12T260 1 0.625 1240 

6 12T285 1/2 0.833 13342 

6 12T286 1 0.157 6 

6 12T287 1/2 0.182 8 

6 127293 1 0.980 71290 

Table A.3 Counts of isogeny classes by Newton polygon 

Dimension Slopes q=2 q=3 =4 q=5 


ele 


(0, 1) 


2 


4 
es | 
ees) 3 3 1 


(0, 0, 1, 1) 


66 


Sasa , 


3 G.44,% 43) 8 10 18 16 

3 (Gipezier 2 2 

4 (0,0, 0,0, 1, 1, 1, 1) 453 5062 18178 | 87115 
4 (0,0,0, 4, ney) 94 1048 6966 | 15698 
4 (0, 0,5.55594> 1,1) 43 381 1398 3108 
4 (0,4.4,4,%.% $1) 26 114 244 526 
4 (0,3.4.3.4.5. 51 6 io 66 70 

4 Gbbehhd}) 36 82 275 168 
4 (hbhhaaaa) [2 [2 (2 [a6 

4 (ded: de de ded 23) 5 6 5 5 

5 0,0,0,0.0,1,1110 [4034 [137776 

5 (0,0,0,0,3,3,1,1,1,1) |1248 | 35796 
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Table A.3 (continued) 


va) 
| 
a 
+t 
| 
ina] 
ie) 
Oe | + 
L}m 1A | 0 |e} ~t 
= /O1]D)]O }] © alr lon 
Sal AM lOlal 4 Ol AALS 
liaqgso i) Gly) So |} st | oo 
ale ianlolia tia tlm] alolta 
Sn ia) oOo) ola ASIN ITN OHM) a) OLA) 
ay ) — 
p—) —J —| a] om ES Pena es 
a a i tha Jenist feaien Joona | Tt] up | a = - 
oe.) Te aes es atin [eat feaica enna} TAP PPP 
ee) 5 ea Se 
te ot ke ae - i = wfo} ow | ep [Aim] sin 
®. - mn ThA | OIF | len | nha, sj ‘“ . . 
Neo IN Sols: Se ~f on] on] = |] Up fain {ain |oia 
i = i TIA Ist | IN | ohn x 7 . « 
pied base wales tha | IG | ica | nite TE sie | aie | otlee | tes 
« Ba | IAL | IQ | etn pale ee 
ee lialaa em Baas ames eso ee 
% ain | ain | a eal re 
Sia | len | SiN ais mI . . s “ep = sin len | SIN 
= a re SRD | SIRF IN "es be . e 
Siar | len | ain “a se x y 2 ~|S| SO] SoS [aim|in 
a O [len | ain | It | leo ue: is ae ue = Slolole 
B| S| S| So [ot | mien HAPS eat Sl S| SL SOS 
LZLS|o|olaolo ain last |i fan! S| S| SL Ol S 
n ee ee ee ee ee ae ee ee ee ee 
=| 
i} 
= 
n 
=| 
= 
5 
Alu lnln|ui|w nMinilm|njololololw 


Isogeny Classes of Abelian Varieties over Finite Fields in the LMFDB 


429 


Table A.4 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 1) 


Slopes Group Angle rank q= q= q=4 q= 
(0, 1) 2T1 1 2 4 4 8 
(3) 1T1 0 2 

G3) fan 0 3 3 3 1 


Table A.5 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 2) 


Slopes Group Angle Rank q=2 q=3 q=4 q=5 
(0, 0, 1, 1) 4T1 2 2 

(0, 0, 1, 1) 4T2 1 7 14 12 26 
(0, 0, 1, 1) 473 2 6 14 22 40 
(03.4.1) 473 2 2 2 8 12 
(44:44) 272 0 1 1 1 

iis ea = 

(5. 33> $) 4T1 0 2 2 
(444) [er [0 ee ee 


Table A.6 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 3) 


Slopes Group Angle Rank q=2 q=3 q=4 q=5 
(0,0,0, 1,1, 1) 6T1 I 4 4 
(0,0,0, 1,1, 1) 6T3 I 4 10 12 28 
(0,0,0, 1,1, ) 6T3 3 2 6 10 42 
(0,0,0, 1,1, 1) 6T6 3 14 32 48 82 
(0,0,0, 1,1, 1) 6T11 [3 32 218 502 1544 
(0.0. 3.4.1.1) 6T3 3 6 12 56 6 
(0,0, 3.3.1.1) 6T6 3 2 10 
(0.0. 3.4.1.1) 6T11 [3 6 42 134 268 
(0.3.4.4, 11) 6T3 3 2 4 
0, gr559 5d 6T6 3 B 2 
0,3,4,5,41 6Tll | 3 4 16 26 30 
1112.22 
(4.4.4.3.3,3) [ers 1 2 4 4 8 
Th Wa 2 De 2 
(5.5.3.3.3.3) [676 3 2 
Ld. 12) 2. 2 
(f.454:343.3) (eTl. 3 6 4 14 8 
1ii1idt1i1i1 
ee mae 9 7 7 
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Table A.7 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 4) 


Slopes Group Angle Rank q=2 q=3 q=4 q=5 
(0,0, 0, 0, 1, 1, 1, 1) 8T2 1 8 12 10 10 
(0,0, 0, 0, 1, 1, 1, 1) 8T2 2 3 

(0,0, 0, 0, 1, 1, 1, 1) 8T3 1 4 12 9 19 
(0,0, 0, 0, 1, 1, 1, 1) 8T4 2 1 3 
(0,0, 0,0, 1, 1, 1, 1) 8T6 4 8 8 
(0,0, 0, 0, 1, 1, 1, 1) 8T9 1 6 20 28 80 
(0,0, 0, 0, 1, 1, 1, 1) 8T9 2 19 59 100 221 
(0,0, 0, 0, 1, 1, 1, 1) 8T9 4 8 
(0,0, 0, 0, 1, 1, 1, 1) 8T1o | 2 10 26 28 32 
(0,0, 0, 0, 1, 1, 1, 1) 8T1o «| 4 4 2 

(0,0, 0, 0, 1, 1, 1, 1) 8Tl1 4 2 2 4 
(0,0, 0, 0, 1, 1, 1, 1) 8T12. |4 6 12 8 22 
(0,0, 0, 0, 1, 1, 1, 1) 8T13. 3 2 8 4 12 
(0,0, 0, 0, 1, 1, 1, 1) 8T13. 4 6 2 
(0,0, 0, 0, 1, 1, 1, 1) 8T17 | 4 2 6 10 
(0,0, 0, 0, 1, 1, 1, 1) 8T1g =| 2 12 84 206 688 
(0,0, 0, 0, 1, 1, 1, 1) stig = |4 2 2 6 12 
(0,0, 0, 0, 1, 1, 1, 1) sT19 4 4 

(0,0, 0, 0, 1, 1, 1, 1) 8T20 |4 2 2 2 
(0,0, 0, 0, 1, 1, 1, 1) gT22 |4 4 
(0,0, 0, 0, 1, 1, 1, 1) 8T23. 4 6 2 4 
(0,0, 0, 0, 1, 1, 1, 1) 8T24 3 50 202 406 1292 
(0,0, 0, 0, 1, 1, 1, 1) 8T24 | 4 12 54 62 356 
(0,0, 0, 0, 1, 1, 1, 1) 8T26 «| 4 4 4 
(0,0, 0, 0, 1, 1, 1, 1) 8T27. «(| 4 24 60 136 
(0,0, 0, 0, 1, 1, 1, 1) sT28 4 4 
(0,0, 0,0, 1, 1, 1, 1) 8T29 | 4 8 34 40 140 
(0,0, 0, 0, 1, 1, 1, 1) 8T30 |4 2 

(0,0, 0, 0, 1, 1, 1, 1) 8T31  |4 8 28 110 
(0,0, 0,0, 1, 1, 1, 1) 8732 |4 2 
(0,0, 0, 0, 1, 1, 1, 1) 8T35 «| 4 26 298 926 3122 
(0,0, 0, 0, 1, 1, 1, 1) 8T38 4 10 48 172 
(0,0, 0, 0, 1, 1, 1, 1) 8T39 4 46 488 1068 4964 
(0,0, 0, 0, 1, 1, 1, 1) 8T40 |4 4 4 10 
(0,0, 0, 0, 1, 1, 1, 1) eT44. | 4 240 3674 15112 | 75662 
0005.5 lolt 8T6 4 4 
0,0,0,4,4,1,1,1 4 2 
0,0,0,4,4,1,1,1 4 4 
(0,0,0,3.3,L11) [er31 |4 2 


(continued) 
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Table A.7 (continued) 


Slopes Group Angle Rank q=3 q=4 q=5 
ee eee aT3s 4 32 108 260 
(0,0,0,3.5,L11) |8t38 4 10 42 48 
(0,0,0,3.5,L11) | st4a 4 1006 = 6814-15378 
(0,0.3.3.5.5,L1) (8t3 2 4 
(0,0,3.3.5.5.L1) (sta 2 1 

(025 eld 8T6 4 4 
0,0,5,4,9,4,11 8T9 p 17 29 62 
(0,0,3.3.4.4,11) [sts [3 4 14 ry 
(0,0,3.4,4,5,11) [erro [2 2 2 
(0,0.3.3.5.5,L1) (sti1 4 2 
(0,0.3.3.3.5.L1) (8T13 3 2 
(0,0.3.3.5.5,L1) |sti7 /4 2 
(0,0.3.3.5.5,L1) jets 2 22 30 108 
(0,0.3.3.5.5.L1) (stig 4 4 
(0,0,3.3.5.5,L1) (822 /4 4 
(0,0,3,334,11) |st24 [3 64 96 252 
(0,0,3,3,3.311) [st24 |4 2 30 
(0,0,3,4,4,5,11) [eta7 [4 4 
(0,0,3,4,4.5,11) [er2e [4 6 16 
(0,0.3.3.5.5,L1) | st29 4 6 4 38 
(0,0.3.3.3.5,L1) (831 /4 4 6 
(0,0.3.3.5.5,L1) | 8T32 4 2 
(0,0.3.3.5.5.L1) (8735/4 22 144 | 150 
(0,0.3.3.5.5,L1) (8384 10 4 
(0,0,3.3.5.5.L1) (8394 38 68 432 
(0,0,3.3.5.5,L1) (sta /4 200 976 1978 
0,5:4>4>4>9> 31) |8T13 |3 2 

OLELEE EI) [ors [4 > 
(0,4.4.4:3,3,3.1) [setae [3 10 18 36 48 
(0.4.4.4.5.5.$1) (8t24 (4 2 2 6 3 
(0.4.4.4.5.5.%1) (8739/4 4 26 40 130 
(04.44.5551) (tae 4 10 64 162 346 
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Table A.7 (continued) 
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Table A.8 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 5) 


Slopes Group Angle Rank q=2 q=3 
(0,0, 0, 0,0, 1, 1, 1,1, 1) 10T1 4 
(0,0, 0, 0,0, 1,1, 1,1, 1) 10T5 10 38 

(0, 0,0, 0,0, 1,1, 1,1, 1) 10TS5 2 

(0, 0,0, 0,0, 1,1, 1,1, 1) 10T14 8 26 

(0, 0,0, 0,0, 1, 1,1, 1, 1) 10T22 34 292 
(0,0, 0, 0,0, 1,1, 1,1, 1) 10T23 12 80 
(0,0, 0, 0,0, 1, 1,1, 1, 1) 10T29 ] 50 
(0, 0,0, 0,0, 1,1, 1,1, 1) 10T36 2 52 
(0, 0,0, 0,0, 1, 1,1, 1, 1) 10T39 3966 137232 
(0, 0,0,0, 5,551, 1,1, 1) 10T22 46 480 
(0,0,0,0, 3.3.1, 1,11) 10723 8 
0,0,0,0, 5,5, 1,1, 1,1 10T29 vs 18 


0,0,0,0,5,5, 1,111 


1 

1 

5 

5 

5 

5 

5 

5 

5 

4 

5 

5 

10T36 5 12 

(0, 0,0,0,4,4,1,1,1,1 10T39 5 1200 35278 
(0,0,0, 3.3.4.4, L 11) 10T22 = (|5 4 52 
(0,0,0, 3.5.5.5. 1 1.1) 10T23|5 2 12 
(00,0, 3.3.3.4. L 11) 10T36 5 2 
(0,0,0, 3.5.5.5. 1 1.1) 10739 |5 526 11492 
(0.0, 4.4.4 $.5, 5,1 .1) 10T11 5 2 
(0.0,4.4.4.3.5.% 1) 10722 5 4 32 
(0.04.4455%b1) 10736 5 10 
0,0, 3.49) 9 9g bl 10739 5 166 2980 
0,0,4,4,9,4.9.4, 11 10T22 4 2 48 
(0,0, 3.4.4.4, 4.3511) 10739 5 60 648 
(0,4.4.4.4.3.3,2,3.1) 10T22. «| 5 4 4 
(44434441) 10739 5 82 964 
(4.445.528 91) 10T39 5 12 166 
(0.5.4. 3:4:9: 5.5551 10T39 5 8 56 
(5.4.$.5.5.$.9.4.9,$) [rors 1 6 14 
(foe h44.4 4) 107T39—s«| 5 134 600 
G.h443993.249)  [rora2 [4 6 24 
G.i449993.3.4)  [20rs9 [5 16 70 
Bo 3050 D0 2 3) 9 903 10739 [5 2 20 
$+ 8 5+ $+ 8 5+ 5+ 8 59 5 1075 1 2 4 
(3.3,3,3,3,3,3.3.3.3) [10739 [5 6 8 
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Table A.9 Counts of isogeny classes by Newton polygon, Galois group, and angle rank (g = 6) 


Slopes Group Angle Rank q=2 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T2 1 10 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T3 1 1 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T10 1 9 
(0, 0, 0, 0,0, 0,1, 1,1, 1, 1, 1) 12T10 3 3 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T18 1 6 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T21 3 1 
(0, 0, 0,0, 0,0, 1,1, 1, 1,1, 1) 12T23 3 3 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 12T23 5 6 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12724 3 3 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T25 3 55 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T48 3 95 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 12T48 ) 12 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T48 6 2 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T76 6 16 
(0, 0, 0, 0,0, 0,1, 1, 1, 1, 1, 1) 12T77 5 40 
(0, 0, 0, 0,0, 0,1, 1,1, 1, 1, 1) 12T78 2 74 
(0, 0, 0,0,0,0, 1, 1,1, 1,1, 1) 12T78 6 2 
(0, 0, 0, 0,0, 0,1, 1, 1, 1, 1, 1) 12T90 3 32 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T101 3 67 
(0, 0, 0, 0,0, 0,1, 1,1, 1, 1, 1) 12T103 3 4 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 12T125 6 22 
(0, 0, 0, 0, 0,0, 1, 1,1, 1, 1, 1) 12T134 6 10 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 12T135 6 2 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T136 6 2 
(0, 0, 0, 0, 0,0, 1, 1, 1, 1, 1, 1) 12T138 6 2 
(0, 0, 0,0, 0,0, 1, 1,1, 1,1, 1) 12T139 3 365 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T139 6 8 
(0, 0, 0, 0,0,0, 1,1, 1, 1, 1, 1) 12T148 6 2 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 12T186 6 4 
(0, 0, 0, 0,0, 0,1, 1,1, 1, 1, 1) 12T193 6 12 
(0, 0, 0, 0, 0,0, 1, 1, 1, 1, 1, 1) 12T208 6 30 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T219 5 1084 
(0, 0, 0, 0,0, 0,1, 1,1, 1,1, 1) 12T219 6 102 
(0, 0, 0,0, 0,0, 1, 1,1, 1,1, 1) 12T222 6 106 
(0, 0, 0,0, 0,0, 1, 1, 1, 1, 1, 1) 127224 6 2 
(0, 0, 0,0, 0,0, 1, 1, 1, 1,1, 1) 12T226 6 2 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T227 6 2 
(0, 0, 0,0, 0,0, 1,1, 1, 1,1, 1) 12T236 6 98 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 127237 6 4 
(0, 0, 0,0, 0,0, 1, 1,1, 1, 1, 1) 12T240 6 2 
(0, 0, 0,0, 0,0, 1,1, 1, 1, 1, 1) 12T250 6 152 
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Slopes Group Angle Rank q=2 
(0, 0, 0,0, 0,0, 1, 1, 1,1, 1,1) 127255 2 
(0,0, 0,0,0,0, 1, 1,1,1,1,1) 127260 968 
(0,0, 0,0, 0,0, 1, 1,1, 1,1, 1) 127285 8810 
(0,0, 0,0,0,0, 1, 1,1, 1, 1,1) 127286 2 
(0,0, 0,0,0,0, 1, 1,1,1,1,1) 127287 6 
(0,0, 0,0, 0,0, 1, 1,1, 1,1, 1) 127293 42488 
(0,0,0,0,0, 3.3.1, 1, 1,1, 1) 127125 2 
(0,0,0,0,0, 3. $.1, 1, 1,1, 1) 127250 28 
(0,0,0,0,0, 5, 5.1, 1,1, 1,1) 127255 6 
(0,0,0,0,0, $3.1, 1, 1,1, 1) 127260 34 
(0,0,0,0,0, 3.3.1.1, 1,1, 1) 127286 4 
(0,0,0,0,0,3,5,L1111 127293 17392 
oe ee ere ma 1276 2 


O00 Or tt 12710 15 
(0,0,0,0, 3,4, 4.4.1.1, 11) 12723 6 
(0,0,0,0, 3,4, 4.4.1.1, 1,1) 12723 1 
(0,0,0,0, 3.3.4.5. 11, 11) 12728 2 
(0,0,0,0, 3.3.4.5. 11,11) 12748 42 
(0,0,0,0, 3.3.4.5. 1, 11) 12748 36 
(0,0,0,0, 3.3.4.5. 1 1, 1,1) 12777 34 
(0,0,0,0, 3.3.3.5. 1 1,11) 12779 2 
(0,0,0,0,3.3.4.5,L11,1) 12790 2 
(0,0,0,0,$.5.3.5.111,1) 127101 19 
OOOO Da aaa 127125 4 


DIRE TR | PR | IR | BIR | TR | IR | IR | SIRE | IR PR | DoT | TR | IR | IR | BTR | TR | IR | BIR | BIR | IR | Be 


DIE NDI NID DI DID] DAD] BLD] BW) WwW] BH] WL DN] HL WL NIN] WIE DIDI DIL DADLAD!L ALDI AADAYS 


0:0; 0,0,4,.25 Bt htt 127125 2 

(0,0,0,0, 3,4, 4.4.1.1, 1,1) 127139 93 
(0,0,0,0, 3.3.3.5. 1 111) 127193 2 

(0,0,0,0, 3.3.4.5. 11, 11) 127219 14 
(0,0,0,0, 3.3.4.5. 1111) 127236 40 
(0,0,0,0, 3.3.4.5. 1, 1,1) 127237 4 

(0,0,0,0, 3.3.3.5. 1 11.1) 127250 34 
(0,0,0,0,$.3.3.5,L11,1) 127260 152 
(0,0, 0,0, eee) 127285 2020 
(0,0,0,0, 3.3.3.3 11,1,1) 127293 5704 
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Table A.9 (continued) 
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Table A.9 (continued) 
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Table A.10 Brauer invariants of endomorphism algebras. Each “zeroes” row combines all 
commutative endomorphism algebras for that g 


g End°(A /F,) invs End°(A /F,) invs | Count qs Example 

1 | zeroes zeroes 6000 all 1.2.ab 
(0) (1/2) 154 all except 169 1.2.ac 
(1/2) (1/2) 30 squares 1.4.e 

2 | zeroes zeroes 1198020 | all 2.2.b_a 
(0) (1/2) 195 all except 121 2.2.a_ae 
(0, 0) (1/2) 80 5,7, 9,13, 17,25... | 2.5.a_a 
(1/2, 1/2) (1/2) 7 25, 49, 169, 625 2.25.a_by 

3 | zeroes zeroes 962658 | all S.2.b bod 
(0) (1/2) 16 3,4, 7,9, 16, 25 3.3.a_a_aj 
(0, 0) (1/3, 2/3) 140 all 3.2.a_a_ac 
(1/3, 2/3) (1/3, 2/3) 2 8 3.8.g_bk_ea 

4 | zeroes zeroes 141018 | 2,3,4,5 4.2.ac_b_ab_d 
(0) (1/2) 14 2,3,4,5 4.2.a_c_ae 
(0, 0) (1/2) 7 3;4,5 4.3.a_a_a_aj 
(0, 0) (1/4, 3/4) 53 2,3,4,5 4.2.a_a_a_c 
(0, 0) (1/2, 1/2) 84 2,3,4,5 4.2.a_ae_a_k 
(1/2, 1/2) (1/4, 3/4) 6 4 4.4.a_m_a_cq 
(1/2, 1/2) (1/2, 1/2) 4 4 4.4.ae_q_abo_dw 
(0, 0, 0, 0) (0,0,1/2,1/2) | 23 3,4,5 4.3.a_ac_a_ad 
(0,0,0,0,0,0) | (0,0,1/2,1/2) | 21 3,5 4.3.a_c_a_ad 

5 | zeroes zeroes 197078 | 2,3 5.2.ad_h_ao_y_abm 
(0, 0) (1/5, 4/5) 20 2,3 5.2.a_a_a_a_ag 
(0, 0) (2/5, 3/5) 6 2,3 5.2.a_a_a_a_e 

6 | zeroes zeroes 89471 2, 6.2.b_ab_a_ad_ad_h 
(0) (1/2) 4 2 62aaeaai 
(0, 0) (1/2) 4 2 6.2.ac_c_a_ae_i_ai 
(0, 0) (1/6, 5/6) 12 2 6.2.a_a_ac_aac 
(0, 0) (1/3, 2/3) 31 2 6.2.a_a_ac_a_a_o 
(0, 0) (1/2, 1/2) 88 2 6.2.a_a_a_c_a_ac 
(0, 0, 0) (0, 1/2, 1/2) 13 2 6.2.a_ac_a_g_a_ai 
(0, 0, 0, 0) (0, 1/3, 2/3) 2 2 6.2.a_a_ac_aaa 
(0, 0, 0, 0) (0, 1/2, 1/2) 1 2 6.2.a_ac_a_c_a_a 
(0, 0, 0, 0) (0,0, 1/2,1/2) | 42 2 6.2.a_ab_a_c_a_ag 
(0,0,0,0,0,0) | (0,0, 1/3,2/3) | 20 2 6.2.a_a_d_aac 
(0,0,0,0,0,0) | (0,0,1/2,1/2) | 14 2 6.2.a_b_a_a_a_ac 
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Fig. A.3 Normalized root discriminants for g = 2 
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Fig. A.6 Normalized root discriminants for g = 5 
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Fig. A.7 Normalized root discriminants for g = 6 
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Fig. A.8 Isogeny Sato-Tate distribution for g = 2 with g = 7, 27, 101, 211 
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Computing Rational Points on Rank 0 ®) 
Genus 3 Hyperelliptic Curves sei 


Maria Inés de Frutos-Fernandez and Sachi Hashimoto 


Abstract We compute rational points on genus 3 odd degree hyperelliptic curves 
C over Q that have Jacobians of Mordell—Weil rank 0. The computation applies 
the Chabauty—Coleman method to find the zero set of a certain system of p-adic 
integrals, which is known to be finite and include the set of rational points C(Q). 
We implemented an algorithm in Sage to carry out the Chabauty—Coleman method 
on a database of 5870 curves. 


1 Introduction 


Given a curve C of genus g > 2 defined over Q, Mordell’s conjecture, proved by 
Faltings [10] implies that the set of rational points C(Q) is finite. Our goal is to 
explicitly compute this finite set of points in the case of a genus 3 hyperelliptic 
curve with Jacobian having Mordell—Weil rank 0. For a general curve, computing 
the set of its rational points is an unsolved problem. However, there are techniques 
available when the Mordell—Weil rank r of the Jacobian is smaller than the genus. In 
this case it is often possible, for a specific curve, to provably find its rational points. 

The rational point-finding techniques applied here for curves of small Mordell— 
Weil rank date back to an idea of Chabauty [6], who proved in 1941 that when 
the rank r is less than the genus g, fixing a prime of good reduction p, the set 
C(Qp) N J(Q) is finite, and hence C(Q) is, where J(Q) denotes the closure in the 
p-adic topology of J (Q). In the 1980s, Coleman made Chabauty’s idea effective: 
he gave an upper bound on the number of rational points [7], now known as the 
Chabauty—Coleman bound. Coleman’s upper bound comes from counting the zeros 
of p-adic (Coleman) integrals on J(Q,) that vanish on J(Q). This bound was 
refined by Stoll [13]. We exhibit some curves that reach the Stoll bound. In 2010, 
Balakrishnan, Bradshaw, and Kedlaya [3] gave a practical algorithm to compute 
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these p-adic Coleman integrals on odd degree hyperelliptic curves and implemented 
the algorithm in Sage [16]. 

Recently, Sutherland [15] constructed a database of genus 3 hyperelliptic curves 
of discriminant up to 107 using the approach of [4]. Armed with a decent upper 
bound on the number of rational points on a curve, a practical algorithm for 
computing these Coleman integrals that vanish on the rational points, and a large 
number of curves to experiment on, one might ask: how effective is Coleman’s 
method for determining rational points? 

For example, Balakrishnan, Bianchi, Cantoral-Farfan, Ciperiani, and Etropolski 
[2] carried out computations to find rational points on 16,977 genus 3 odd degree 
hyperelliptic curves with rank 1 Jacobian. They found that in most cases, the 
Chabauty—Coleman method picks up exactly the rational points, or the rational 
points and the Weierstrass points that are defined over Q,. In the remaining cases, 
they were able to explain the existence of the extra points either by torsion, linearity 
of the integral, or by extra endomorphisms of the Jacobian. 

In order to satisfy the Chabauty—Coleman hypothesis that r < g we need r = 
0, 1, or 2, but in the case of r = g — 1, one will likely find many extra p-adic but 
non-rational points when applying the Chabauty—Coleman method, and in general 
the Chabauty—Coleman set will be larger than the set of rational points. More work 
would be needed to determine the set of rational points; for example, one could 
implement a Mordell—Weil sieve. For these reasons, we focused on the case of g = 
3 and r = O. In this case, we computed the rational points in Magma [5], up to 
height 10° and verified these points using Chabauty—Coleman calculations. Most of 
the time, in 3070 curves, the only extra points in the Chabauty—Coleman set were 
Weierstrass points that were defined over Q, \ Q. In 17 cases, we picked up points 
on the curve giving rise to torsion points of higher order on the Jacobian. These 
points are defined over quadratic fields where the prime p splits. 

In Sect.2, we present background on Coleman integration: we define the 
Coleman integral and list its properties, and we also discuss explicit Coleman 
integration. In Sect.3, we discuss our implementation of the Chabauty—Coleman 
method for computing rational points on hyperelliptic curves of genus 3 with rank 
0 Jacobian. Finally, in Sect.4, we present our results and give an overview of the 
rational points and extra Q,-points found on the database of curves. We discuss 
several interesting examples in detail, including a curve with an 18-torsion point on 
its Jacobian and a curve that is sharp for Stoll’s bound. 


2 Background on Coleman Integration 


In this section we will define the basics of Coleman integration and necessary 
background for the rest of the paper. For a more thorough introduction to the rigid 
analytic geometry and theory behind Coleman integrals, we refer the reader to [8], 
whose exposition we have loosely followed herein. Other good background on the 
Chabauty—Coleman method can be found in [11] and [17]. 
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Fix K a field that is complete with respect to a non-archimedean absolute value 
and fix some embedding of K into C,. Let R be the ring of integers of K. 

Let C/R be a smooth connected affinoid curve over K and set Cx = C xr K. 
mee F be the residue field of K, and C K be C Xp F. There is a reduction map 

: Ce > C kK. We say that C has good reduction over K if C K 1s smooth. 
7 ait the preimage of a point under the reduction map red~!(x) for x € Cx the 
residue disc of x. These residue discs partition Cx (K). 

Let C have good reduction, w be a holomorphic one-form on C, and P,Q € 
C(K) be K-rational points. Coleman defined an integral : w in the following way. 
Fix a Frobenius lift @ : Cx — Cx, that is, a morphism of rigid analytic varieties 
which reduces to (relative) Frobenius on C K. 


Theorem 1 ([8, Theorem 2.1]) Suppose there is a polynomial P(T) ¢€ C,[T] 
which does not vanish on any root of unity, and that 


P(o*)w is exact. 


Then there is a function fy on C(Cp) which is analytic on each residue disc such 
that df = w and P(¢*)( fw) is analytic. The function fy is unique up to a constant 
and is independent of $ and P. 


In this way, Coleman provides a way to compute a locally analytic antiderivative 
of w. Coleman proves, in several corollaries, properties of the antiderivative, which 
we collect here in one theorem: 


Theorem 2 ((8, Theorem 2.3, Proposition 2.4]) The integral satisfies the follow- 
ing properties: 


1. Linearity: [Pot Bo’) = a feat Be a! 
2. Additivity: [po = [Pot [5 o 
3. Change of variables: if C’ is a curve and ¢ : C — C' a rigid analytic map 


7 i ee w. For example, ¢ can be taken to be 


between wide opens then {? go= 
a lift of the pth power Frobenius. 
4. Fundamental theorem of calculus: i df = f(Q) — f(P). 


5. For any divisor D := )°; Qj — P; of degree zero on C, the integral Is o= 

»; ied w is well-defined, and ts @ = 0 when D is principal. 

oO 
6. Galois equivariance: if o is an automorphism of C p, then ie w) = Bs wo” 

Then Coleman shows that we can use this integration theory to obtain arithmetic 
information about points. 

Suppose now that Cx is the base change of a curve C defined over Q and that 
the Jacobian J of C/Q has Mordell—Weil rank r < g. Fixit: C > J to be an 
embedding given by choosing a Q-rational basepoint b € C(Q) and mapping P 
to [P — b]. Denote by Jx the base change of J to K. The embedding ¢ induces 
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an isomorphism of vector spaces .* : H (Jn, 2!) + H(Cr, Q!) which is 
independent of the chosen basepoint b (Proposition 2.2 of [12]). 
We have functionals 


he(D) =f w, forw € H°(Jx, Q'). 
D 


If D= > 12: — P;], then A, can also be thought of as functionals on Cx by 
gtd) = 
Note that H°(Jx, Q!) is g-dimensional but the Jacobian J(Q) has rank r < g. 


Fix a basis wo, ..., @g—1 for the space of holomorphic differentials on Jx. Define 
functionals 


(dD) = | a: J(Q—- K. 
D 


Consider all linear relations between the restrictions of the 4; to J(Q): there are at 
least g — r independent linear relations and they are of the form 


g-l 
Dea f @; = 0, witha; € K. 
5 D 


Let S be the set of functionals w(D) := ee Oj 15 a; from Jx to K such 
that w(D) = 0 for all D € J(Q). Via the map v, we can define w(z) := 
ys Qj he “aw; for any z € C(K). Note that in particular 4(z) = O for every 
zinC(Q). 

By Theorem |, the functionals jz in S are locally analytic, and we can use bounds 
on the numbers of zeros of p-adic power series to get control over the number 
of zeros for jz on each residue disc. In particular, there are at most finitely many 
common zeros of any non-zero linear combination of functionals. Hence the set of 
rational points C(Q) is contained in the finite set of points satisfying (z) = O for 
every win S. 

Coleman’s proof suggests an effective method for computing C(Q): first, 
compute the linear relations that the functionals satisfy. Then, compute the finite 
zero set of the functionals on each residue disc. This second step requires an 
algorithm for computing the Coleman integral to some p-adic precision. 

Balakrishnan, Bradshaw, and Kedlaya give a practical algorithm for computing 
FF 4 w, on an odd-degree hyperelliptic curve C, where P and Q are in C(Q,,) and 
is a one-form of the second kind. We give a brief outline here, and for more details 
refer the reader to [3]. 

To evaluate Coleman integrals, we work locally in each residue disc, where the 
Coleman integral can be expressed as a convergent power series in a parameter, 
called a local coordinate. If K(C) is the function field of C, then our local 
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coordinate is simply a uniformizing parameter t for K(C)p, that is, an element 
t € K(C)p such that the valuation of t is one. 

Our residue discs come in two flavors: those that are the residue disc of a 
Weierstrass point, the Weierstrass residue discs, and those that are not, the non- 
Weierstrass residue discs. The infinity disc which contains the point at infinity is a 
Weierstrass residue disc in the odd degree hyperelliptic case. 

For a hyperelliptic curve C/K we have differentials w; = (x!/2y) dx, fori = 
0,...,2g— 1 such that their classes [w;] generate His (C). For any one-form w, we 


can write w = pu ajw; +df for some a; € K and exact differential df, and then 
use linearity to break up an integral. 

Assume that P and Q are in the same residue disc, and that this disc does not 
contain a pole of w. To compute a Coleman integral between P and Q, called a tiny 
integral, we simply write P, Q, and w in a local coordinate t, P(t), Q(t), w(t), and 
integrate formally: 


2-1 


aS 20) x(t) dx(t) 
Ye aj 4 dt+ f(Q)- f(P). 
i=0 


Q Q Q 
o= dj wit [ df = —— 
[ dX if ‘Ip = a) 2y(t) dt 


To compute a Coleman integral between P and Q in different non-Weierstrass 
residue discs we fix Teichmiiller points P’ in the residue disc of P and Q’ in the 
residue disc of Q such that #(P’) = P’ and ¢(Q’) = Q’. By the previous algorithm 


we can compute tiny integrals [ a ‘@ and I 2" «) between our desired points and 
the Teichmiiller points. Applying the change of variables property of the integral, 


i id b*wo= i w. Thus we compute @*w; for each basis vector w;. Then ¢* acts 


linearly on cohomology, so we can represent @* as a matrix M. However, $*a@j is 
only cohomologous to a linear combination of @;, so to represent the action of ¢* 
as a matrix, we write 
2g-1 
* 
Pa; = > Mjjo; + dfi. 


j=0 


This gives us a linear system, and we can solve 
2g-1 
(M—-1)| fF o; | =| Disp) AP) — fi’) 


for the value of i w;. By linearity, we can compute 2 @. The value of the 
original integral is simply the sum a ‘ot . Ef o+ f a w. Finally, for the case 
of P or Q in a Weierstrass disc, we use the fact that if P is a Weierstrass point 
f i wo=1/2f a @ where 1 is the hyperelliptic involution (Lemma 16 of [3]). 
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3 Algorithm 


In this section we describe our implementation of the Chabauty—Coleman method 
for our case of interest. For a more general description of this method, see for 
instance [11]. Our code is available at [9]. 

We require C to be a hyperelliptic curve of genus 3 given by an odd degree model 


C:y*? = F(x), 


where F(x) € Q[x] is a monic polynomial of degree 7. Every such model has a 
unique rational point at infinity, that we denote oo and take as our basepoint for the 
Abel—Jacobi embedding « : C <> J. Moreover, we assume that the Jacobian J of 
C has Mordell—Weil rank 0 over Q. Let p > 7 be the smallest prime such that C 
has good reduction mod p (note that p does not divide the leading coefficient of F 
since we assume that F is monic). 

Our goal is to provably compute the set of rational points of C. We first use 
Magma to compute C(Q)known to be the set of all rational points in C of naive 
height bounded by 10°. Having this set reduces the number of Coleman integrations 
required by the algorithm. 

The input of our algorithm consists of the hyperelliptic curve C, the prime p and 
the set C(Q)known- The output is a finite subset of C(Q,) containing C(Q), which 
is returned as three separate subsets: 


¢ The set C(Q) of rational points of C. 

° The set of points Q in C(Q,) \ C(Q) such that [Q — oo] € J(Q,) is 2-torsion. 

° The set of points Q in C(Q,) \ C(Q) such that [Q — co] € J(Q,) is an n-torsion 
point for some n > 2. 


Note that since we are assuming that J has rank 0, no point in C can give rise to 
a point of infinite order in J. Moreover, we remark that under this rank hypothesis 
we automatically know that the annihilator of J(Q) under the integration pairing is 
spanned by w; = x! /2y dx fori = 0, 1, 2, which simplifies Step 2 of our algorithm. 

We use essentially the same precision analysis as in [2]. Proposition 3.11 in [2] 
can be easily adapted to show that it is enough to set N = 2p + 4 to be the p- 
adic precision and M = 2p + | to be the t-adic precision. We give a basic outline 
of the adaptations needed here (the reader should replace the statements in their 
paper about the annihilating differentials a, 6 with the annihilating differentials 
@0, W1, 2). 

First note that assuming [2, Proposition 3.11], N = 2p+4and M = 2p+4 1 
is sufficient for p prime p > 2g = 6 (this follows from Riemann-Roch, see [2, 
Remark 3.12]). The proof of this proposition depends on two earlier statements: [2, 
Lemmas 3.10 and 3.9], which do not depend on the rank of the curve or the exact 
set of annihilating differentials. 
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While the lemmas and propositions in [2, Section 3.1] that bound the number of 
zeros in each residue disc of the curve remain true with minimal modification to the 
proofs, we did not use these bounds to rule out residue discs in our implementation. 

An outline of our algorithm is given in Algorithm 3. Below we explain its main 
steps. 


1: function CHABAUTY—COLEMAN(C, p, C(Q)known) 

2 Set the p-adic precision N to 2p + 4. 

3 Set the t-adic precision M to 2p + 1. 

4 Initialize found-points := empty list. 

5: for each P ¢ C(F p) up to the hyperelliptic involution ¢ do 

6 Compute fo, fi and f in local coordinates. 

7 for each point OQ € C(Q,) corresponding to a common zero of the f; do 
8 Add Q and 1(Q) to found-points. 

9 


end for 
10: end for 
11: Classify found-points into three lists: Q-points, non-rational 2-torsion- 


points, and non-rational higher-torsion-points. 

12: return Q-points, non-rational 2-torsion-points, and non-rational higher-torsion- 
points. 

13: end function 

Algorithm 3 Chabauty—Coleman method for a genus 3 hyperelliptic curve with rank 0 Jacobian 


Step I (Required Precision) We need to choose the p-adic precision N and the t- 
adic precision M to guarantee that, in Step 3, we will obtain all the roots of fj (pr) 
in Zp.SetN =2p+4andM =2p+1. 


Step 2 (Annihilator) A basis of the space of differentials H °(Cg,» Q!) is given by 
{w@o, @1, w2}, where w; = (x! /2y) dx. For each i = 0, 1, 2, define 


< 
fi) = Wj, 


where oo denotes the point at infinity. The functions f(z) are zero on all rational 
points of C, but not identically zero. 


Step 3 (Searching in Residue Discs) For each point P in C(F p)» we compute the 
set of Q,-rational points P reducing to P such that f;(P) = 0 fori = 0, 1,2. To 
perform this computation, we consider two different cases: 


(i) If there is a point P € C(Q)known reducing to P, then we expand each @; in 
terms of a uniformizer ¢ at P and we formally integrate to obtain three power 
series f;(t), that parametrize the integrals of the w; between P and any other 
point in the residue disc. 
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(ii) Otherwise, we start by finding a Q,-point P reducing to P = (%, Yo) (note 
that P cannot be oo in this case). If yo = 0 we can take P = (xo, 0) where xg 
is the Hensel lift of xo to a root of F(x). Otherwise, we can take P = (Xo, yo) 
where xo is any lift of x9 to Zp and yo is obtained from Yo by applying Hensel’s 
Lemma to y? = F(x). Then we set f(t) = fit) + fie @;, Where each fi(t) 
parametrizes the integral of @; between P and any other point in the residue 
disc. 


To provably compute the set of common zeros of fo, f; and f2 to a desired 
precision, we need to impose the condition that at least one of the f; has only simple 
roots. We check this requirement by computing the discriminant of the truncations 
of the power series to t-adic precision M. If all the f; have multiple roots, we run 
the algorithm using a bigger prime p for which they do not. 

Say fi, has no double roots. Then we use the PARI/GP function polrootspadic to 
compute its roots, truncating it first to t-adic precision M = 2p + | and computing 
the coefficients up to p-adic precision N = 2p + 4. For each root that lies in pZp, 
we check whether it is also a root of the other two fj; if so, it corresponds to a point 
P € C(Q,) lying over P. 


Step 4 (Identifying the Rational Points) Now, for each of the points Q found in 
Step 3, we attempt to reconstruct Q as a Q-rational point, using Sage. If this is not 
possible, then [Q — oo] must be a torsion point in J (Q,), because J has rank 0. If Q 
is a Weierstrass point, it will give rise to a 2-torsion point in the Jacobian; otherwise 
we classify it as a higher order torsion point. 


Step 5 (Identifying Higher Order Torsion Points) For each of the points Q that 
is not identified as a rational or Weierstrass point in Step 4, we use Sage’s 
algebraic_dependency function to attempt to find the minimal polynomial of the x- 
coordinate and reconstruct the point as a point with coordinates defined over an 
extension of Q. We then use Magma to compute the torsion order of [Q — oo]. 


4 Analysis and Examples 


We ran the Sage implementation of our algorithm on a database of 5870 hyperel- 
liptic curves of genus 3 with Jacobian of rank 0 taken from the list [1]. This list [1] 
is a subset of Sutherland’s database of genus 3 hyperelliptic curves of discriminant 
bounded by 107 [15]. It was created by first filtering for curves with odd-degree 
models and then filtering for curves that are provably rank 0. The rank computation 
was done with the Magma function RankBounds, which returns an upper and 
lower bound for the rank of the curve through descent computations. For curves in 
[1], the upper bound and the lower bound returned by RankBounds agreed and 
equaled 0. We computed a set of monic models for each curve in [1] for the purpose 
of our computations. 


Computing Rational Points on Rank 0 Genus 3 Hyperelliptic Curves 457 


2100 
2000 
1900 
1800 
1700 
1600 
1500 
1400 
1300 
1200 
1100 
1000 
900 
800 
700 
600 
500 
400 
300 
200 
100 


Number of curves with n rational points 


Fig. 1 Number of curves in the database with n rational points 


In 23 of these curves, there was at least one Q,-point for which the three power 
series fo(z), f1(z) and f2(z) had double roots at the first prime p > 7 of good 
reduction. To analyze these curves, we replaced p by a bigger prime for which at 
least one of the power series had only simple roots. 

Our implementation proves that, for each of the studied curves, the entire set 
of rational points equals the set of rational points of naive height less than 10°. 
In fact, we proved that, with respect to the odd-degree monic models used in the 
computation, the maximum among the global heights of rational points in those 
models is 30.7611440827071, reached only by points at three curves. Or, if we 
translate this to the odd degree models in [1], those three curves have points of 
global height 4.394449 15467244, while the heights of the rational points in the rest 
of the curves is bounded above by 3.! 

Next, we show in Fig. | how many of the curves in our database have a certain 
number of rational points. We observe that the maximum number of points is six, 
and that a vast majority of the curves have three or fewer rational points. 

As mentioned on the introduction, the Chabauty—Coleman sets of these curves 
agreed with the set of rational points on 2783 cases, contained only rational points 
and Weierstrass points on 3070 cases, and contained points on the curve giving 


'The global height is the absolute logarithmic height of the point, which is the maximum of the 
absolute logarithmic heights of its coordinates. For a rational point 4, this height is computed as 
max (log(|1|), log(|d|)). 
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rise to higher order torsion points on the Jacobian on 17 cases, 13 of which also 
presented non-rational Weierstrass points. In each of these 17 curves, there was 
exactly one pair of points corresponding to higher order torsion points, of order 8 
in ten cases, order 12 in six cases, and order 18 in one case, which we present in 
Example 2. We made these 17 higher torsion examples available on [9]. 

We conclude this section by showing how the algorithm works on a particular 
curve, giving an example where the algorithm detects a torsion point defined over a 
number field, and exhibiting an example curve where the Stoll bound is sharp. 


Example 1 Consider the hyperelliptic curve 
C: y* =x! — 37024x° + 3134464x° — 101220352x* + 1613758464x>— 
— 13656653824x7 + 59055800320x — 103079215104, 
with minimal discriminant 2168084 and conductor 1084042. 
Using Magma, we find that the set of rational points of C with height bounded 
by 10° is 
C(Q)known = {00, (32, 0)}. 


Since C has good reduction modulo 7, we run our code using this prime. The points 
of C(IF7) are as follows: 


{oo, (0, £4), 1, £5), (2, £6), (4, 0), (6, £2)}. 


After Hensel lifting each of these points to a point of C(Q7), we write fo(z), 
fi(z), and f2(z) in local coordinates and find their common zeros. This yields the 
following results: 


Disc Common roots of fo(z), fi(z) and f2(z) 
oo oo 

(0, £4) +4) No common roots 

(i, £5) No common roots 

(2, £6) +6) | No common roots 

4,0) | (32,0) 


(6, £2) +2) | No common roots 


Therefore, we have shown that C(Q) = {o0, (32, 0)}. 
Example 2 Let 


1 24 27 1 1 
Ct ae =F 8 2 x® 2 x 3 a = x? x x 
64 256 4096" 8192 65536 65536 4194304 
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be the hyperelliptic curve with minimal discriminant 6856704 and conductor 
6856704. Using the algorithm described in the paper, we can determine that this 
curve has only one rational point, the point at infinity. However, we also detect a 
pair of 7-adic points with x-coordinate 


66.7 BGs 4G 4 bs F667 467 467" 46.7" 4 00"). 


Using sage’s algebraic_dependency function, we identify this pair of points as 
(—1/8, +/—3/2!!). The points [(—1/8, +/—3/2!!) — oo] have order 18 on the 
Jacobian over Q(./—3). The Chabauty—Coleman algorithm detects these torsion 
points at the prime p = 7 because 7 splits in this field. 


Example 3 When p > 2g is a prime of good reduction andr < g — 1, then Stoll 
[14, Corollary 6.7] has improved the bound given by Coleman to give #X(Q) < 
#X(F,) + 2r. In our case, this shows that #X(Q) < #X(F,). In our database of 
5870 curves, 94 achieve equality for Stoll’s bound. 

For example, the curve 


1 7 1 3 1 1 
y=x’ x aa ane x" + gx + aa 


which has minimal discriminant 4089600 and conductor 170400, has 6 rational 
points of height less than 10°: 


{>> (% zug) (555) (3°) (8:9) -(-75-°)| 


as well as 6 points over F7: 


{oo, (0, 1), (0, 6), C1, 0), (6, 0), (3, 0)}. 
By Stoll’s theorem, we know that there cannot be any more rational points. 
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Abstract We construct curves of each genus g > 2 for which Coleman’s effective 
Chabauty bound is sharp and Coleman’s theorem can be applied to determine 
rational points if the rank condition is satisfied. We give numerous examples of 
genus two and rank one curves for which Coleman’s bound is sharp. Based on one 
of those curves, we construct an example of a curve of genus five whose rational 
points are determined using the descent method together with Coleman’s theorem. 


1 Introduction 


One of the fundamental problems in the arithmetic of curves and abelian varieties 
is to find ways to use p-adic information on curves to determine rational points. 
It is known that p-adic points have a richer structure than rational points, and, in 
principle, are easier to determine. Thus, it is natural to try to use the knowledge of 
p-adic points in studying rational points. There are several such approaches, e.g., 
the Hasse principle and its generalizations, or bounding ranks of elliptic curves by 
Selmer groups. 

The method of Chabauty and Coleman, nowadays also called “abelian 
Chabauty,” is a powerful p-adic method, and it is often very successful in 
determining rational points on curves whose Jaconians satisfy a certain rank 
condition. Coleman’s “effective Chabauty” theorem (Theorem 2) gives an upper 
bound on the number of rational points on these curves. However, it is rare that the 
direct application of Coleman’s theorem succeeds in determining all rational points 
because the bound from Coleman’s theorem is not sharp in most cases. 

In fact, there are only a few curves known for which Coleman’s bound is 
sharp. Hence it is of interest to study the question highlighted by Coleman’s result: 
Regardless of the rank of its Jacobian, when does the number of rational points 
on C meet or exceed Coleman’s bound at p? We will call curves for which the 
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rank condition is satisfied, and for which Coleman’s bound is sharp for some prime 
number p of good reduction, sharp curves at p. The curves such that the number of 
their known rational points meets Coleman’s bound regardless of the rank condition 
will be called potentially sharp curves at p. Note that we cannot use the method 
of Chabauty and Coleman to determine the set C(Q) for potentially sharp curves 
C because we did not check the rank condition. However, if the rank condition 
holds for a potentially sharp curve, then the curve is sharp; hence, we determined its 
rational points. Curves that exceed the bound will be called excessive at p. Curves 
for which there is a prime p such that the curve is sharp at p, potentially sharp at p, 
or excessive at p are called sharp, potentially sharp, and excessive, respectively. We 
note that the rank of the Jacobians of excessive curves is at least g. 

We concisely introduce the method in Sect. 1.1, and the computations of the rank 
of Jacobians of hyperelliptic curves in Sect. 1.2. In Sect. 1.3, we briefly present the 
descent method for determining rational points on curves. The descent method is 
usually used to descend to curves of genus zero and one. Thus, in Sect. 2.3, we 
construct a curve whose rational points are determined by using descent to curves 
of genus two, where one of them is sharp. 

After mentioning two known sharp curves in Sect. 2.1, we give various examples 
of sharp curves of genus two in Sect.2.2, and one with the smallest number of 
rational points in Sect. 2.4. In Sect. 2.5 we list two curves that violate Coleman’s 
bound, and we use this information to determine their ranks. 

In the third section, we start by presenting sharp curves of genus three, four, 
and five in Sect. 3.1. Using results on the existence of primes in short intervals, we 
construct infinitely many potentially sharp curves of each genus g > 2 in Sect. 3.3. 
We give examples of sharp curves of genus g = 4 and g = 5 based on this 
construction. 

All computations were done in Magma [BCP97]. 


1.1 The Method of Chabauty and Coleman and Its History 


In 1922, in [Mor22], Louis J. Mordell proved that the group of rational points of an 
elliptic curve E/Q is finitely generated. At the end of the article, Mordell observes 
that curves of genus g > 2 have only finitely many rational points. The statement 
that for a nice (smooth, projective, and geometrically irreducible) curve C of genus 
g > 2 and a number field K it holds that #C(K) is finite was named the “Mordell 
conjecture”. After Mordell’s theorem it was a natural question whether the group 
A(K) is finitely generated for all abelian varieties A/K, where K is any number 
field. In 1928, André Weil proved in [Wei29] that for all abelian varieties A over the 
number field K we have 


A(K) = Z’ @ A(K tors, 
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where A(K )tors is the torsion subgroup of A(K) and r € Ng is called the rank of 
A(K). 

In the 1940s, Claude Chabauty proved the Mordell conjecture for curves that 
satisfy a certain condition. 


Theorem 1 (Chabauty 1941, [Cha41]) Let C be anice curve of genus g > 2. Let 
J(C) be its Jacobian, and let r be the rank of J(C) over Q. Ifr < g, then C(Q) is 
finite. 


We briefly discuss the idea behind the proof. We may assume that C(Q) # U and 
embed C(Q) via an Abel-Jacobi map into J(C)(Q), and do the same over the field 
Q,, where p is a prime of good reduction for C. That means that C, the curve that 
is obtained by reduction of C modulo p, remains a smooth curve over F,. We have 
the following commutative diagram. 


CQ —> J(C)Q 


| ! 


C(@) —> J(C)(Q) 


The set J(C)(Q,) has the structure of a g-dimensional p-adic manifold, 
containing C(Q,) as a 1-dimensional submanifold. The discrete group J(C)(Q) 
also sits inside J(C)(Q,), and we take its closure in the p-adic topology of 
J(C)(Qp), denoted by J(C)(Q), to make it a submanifold of J(C)(Q,). Then 
the dimension of J(C)(Q) is not greater than r, so heuristically for dimensional 
reasons, it should have finite intersection with C(Q,), which was what Chabauty 
proved. This intersection contains the set C(Q) (more precisely, C(Q) injects into 
it), implying the finiteness of C(Q). 

This theorem remained the most significant result towards the Mordell conjecture 
until Gerd Faltings proved the conjecture unconditionally, i.e. for all nice curves of 
genus g > 2, in [Fal83]. It seemed that Chabauty’s theorem lost its value. But, that is 
not true because Robert Coleman in 1985 found a way to use Chabauty’s approach 
to state and prove an effective version of the theorem. 


Theorem 2 (Coleman 1985, [Col85a]) Let C be a nice curve of genus g > 2. Let 
J(C) be its Jacobian, and let r be the rank of J(C)(Q). Let p be a prime of good 
reduction for C. Ifr < g, and p > 2g, then 


#C(Q) < #C(Fp) + 2g — 2. 


Coleman invented a theory of p-adic integration on curves in [Col85b], which has 
properties that we would expect from integration, such as linearity in integrands and 
additivity in endpoints. When two endpoints are in the same residue disc, i.e., when 
both have the same reduction modulo p, we can compute the integral in the expected 
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way by expressing the integrand as a power series in a local parameter and then 
integrating term by term. There are more important properties, such as Frobenius 
equivariance, and unexpectedly, path-independence, unlike the real and complex 
case. The last two properties turn out to be very convenient, and all properties 
together make Coleman integrals practically computable, which makes Chabauty’s 
method effective. Using Coleman integration, we can construct a locally analytic 
function p : C(Q») —> Q>,, which vanishes on C(Q). We can estimate the 
number of zeros in each residue disc, and summing all estimates gives us the bound 
in Theorem 2. 

We can furthermore investigate the function p in each residue disc (and some- 
times combine results with some other methods) to provably determine C(Q). This 
locally analytic function p is the main ingredient in determining rational points 
on curves, and it is a main object of research nowadays to extend the method 
to the complementary case r > g. This is an explicit version of a non-abelian 
extension of this method, a program initiated by Kim in [Kim05] and [Kim09] when 
r > g. Quadratic Chabauty is possible when r = g, because then, instead of linear 
functionals (integrals), we use quadratic functions (p-adic heights) to determine 
rational points. The first explicit examples can be found in [BBM 16]. 

By a careful investigation inside each residue disc, Michael Stoll in [Sto06] 
proved a theorem that improves Coleman’s bound. 


Theorem 3 (Stoll 2006, [Sto06]) Let C be anice curve of genus g > 2. Let J(C) 
be its Jacobian, and let r be the rank of J(C) over Q. Let p be a prime of good 
reduction for C. Ifr < g —1, and p > 2r + 2, then 


#C(Q) < #C(F,) +: 2r. 


As a corollary, we have the following rank statement for sharp curves. We know 
that r < g. Ifr < g — 1, then Coleman’s bound cannot be sharp. 


Corollary 4 Let C be a sharp curve of genus g > 2. Then the rank of the Jacobian 
of C over Qisr=g-1. 


If a curve satisfying the rank condition is not sharp, this means that looking at the 
local information at p is not sufficient to determine its rational points. Nevertheless, 
it is often possible to compute the rational points by combining the Chabauty 
information at p with v-adic information for other primes v, for instance, obtained 
through the Mordell-Weil sieve. The Mordell-Weil sieve was introduced in [Sch99], 
and well explored in the paper [BS10]. We also note that the Mordell-Weil sieve can 
be successfully combined with the Quadratic Chabauty method, see, e.g., [BBM17]. 

One reference for more details about the method of Chabauty and Coleman 
is [MP12]. The notes [Sik15] contain some very explicit examples, including the 
combination of the method with the Mordell-Weil sieve. 
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1.2. On the Computation of the Rank of the Jacobian of 
Hyperelliptic Curves 


The problem of computing the rank of abelian varieties is extremely difficult in 
general and still open. It is related to one of the most important conjectures in 
arithmetic geometry. This is the Birch and Swinnerton-Dyer conjecture, which states 
that the rank of an abelian variety can be computed analytically, namely, as the order 
of vanishing of its L-function at s = 1 (for more details see, e.g., [Tat66]). The 
latter quantity is called the analytic rank. Thus, if one is willing to assume the Birch 
and Swinnerton-Dyer conjecture, then the rank of the Jacobian of a hyperelliptic 
curve can be computed as an analytic rank. We can compute the L-function of 
hyperelliptic curves in Magma using the algorithm by Dokchitser [Dok04], and we 
can evaluate it at s = 1 as well as its derivatives. We need to be careful that we can 
compute these values only up to some precision. In general, we cannot prove that 
some value is zero, although we can verify that it is very close to zero. 

Algebraically, there has been a lot of progress since Weil’s theorem, and there 
are methods to compute the rank that work in many cases, but none of them is 
guaranteed to work, even for the simplest case of elliptic curves. One of the first 
explicit computations for dimension greater than one appeared in the paper [GG93], 
followed by [Sch95]. It was later generalized and implemented for hyperelliptic 
curves in Magma by Stoll, [Sto01]. 


Let J be the Jacobian of a hyperelliptic curve C : y* = f(x). From the Kummer 


exact sequence 0 —> J[2] — J Ee J —~ 0, and the long exact sequence of 


Galois cohomology, we get the following commutative diagram (Mg is the set of 
places of Q): 


0 — > J(Q)/2J(Q) ———— > H! | J(2]) ——— > H'(Q, J)[2] ——- 0 


| | “s, J 


0— [I 1(@)/27(Q.) — J] #°@., J2])) — J] #°@., D2] 0 


veMo ve Mg veMg 


The kernel of the map @ is the Selmer group Sel (J/Q), and the kernel 


IQ, J) :=ker| H'(Q, J) — [] H'Q,/) 


veMo 


is called the Shafarevich-Tate group. 
We have the following exact sequence from the diagram above 


0 — J(Q)/2J(Q) — Sel (J/Q) — IQ, J)[2] — 0. 
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It is a well-known fact that Sel)(J/Q) is a finite group. The idea of the proof, 
which works for more general Selmer groups and can be found in, e.g., [HS00, 
C.4], is to construct an injection into a finite commutative group indexed by bad and 
infinite primes. In [HSOO, C.4], we also see that the Selmer group is, in principle, 
effectively computable. We have 


dimg, J(Q)/2J(Q =r + dimg, J(Q[2] < dimp, Sel (J/Q). 


The difference between the two sides is exactly dimp, H1(Q, J)[2]. 

We can easily determine J (Q)[2]; it depends on the factorization of f over Q. 
Thus, we can obtain an upper bound on the rank of J(Q). 

For the lower bound, we search for rational points on J, up to some bounded 
height, and hope that eventually, we will find the same number of independent points 
as the upper bound for the rank. If the curve has many rational points, then we might 
be able to find a lower bound on the rank, see the remark in Sect. 2.5. The idea to use 
Coleman’s bound to obtain the lower bound for the rank with the infinite families of 
curves as the examples and the improvements on the lower bound can be found in 
[JT99]. 

From the discussion above, we easily see why the algorithm is not guaranteed to 
work, but it still works very well in practice. We also see that the (partial) knowledge 
of I11(Q, J) can improve upper bounds, e.g., as in [Sto01, Chapter 8]. 


1.3. Introduction to the Descent Method 


The descent method is one of the first methods ever used for determining rational 
points on curves. It is still very applicable, especially for hyperelliptic curves. The 
basics of this method are explained in more detail in [Sto15]. For explicit descent 
on hyperelliptic curves, see [BS09]. The descent method is based on the property of 
unique factorization in Z. In a sense, it is a generalization of the fact that if a product 
of two numbers is a square, then these two numbers have the same squarefree part. 
We can bound the possible greatest common divisor of these numbers, which gives 
only finitely many possibilities for their squarefree parts. In principle, we have a 
strategy to reduce all possible factorizations to a finite number. 

Let us make it more precise. Suppose that C/Q is a hyperelliptic curve given by 
the equation 


C:y = fix) f(x), 


where f| and f2 are non-constant coprime integral polynomials, with at least one 
of deg(f|) and deg(f2) even. Then, for every point (xo, yo) € C(Q), we have 
fi(xo) # 0 or f2(xo) # O, and there is a unique squarefree d € Z, and zo, to € Q, 
such that 
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fixo) = dz5,  fo(xo) = dip. 
Denote by Cg the curve given by equations 
Ca: fi) =z, fox) =dr’; 
then there is a covering 1g : Cg —> C given by 
(x, Z,t) > (x, dzt). 
It follows that 


CQ) =|) maCa), 


deS 


where S Cc Z denotes the set of all squarefree numbers. This set is infinite, but we 
can reduce it to a finite set. Let us assume that f; and /2 are monic polynomials 
(otherwise, we have to include primes that divide their leading coefficients). As 
these polynomials are coprime, their resultant is a non-zero integer R. The resultant 
gives information on possible common roots. So, for all primes p such that p { R, 
the resultant of f, and f2 over F, is not zero, implying that f; and f2 do not have 
a common root in F,. It means that f| and f2 are “coprime at p”, i.e., that it is not 
possible that p divides the squarefree part d. So, the number of possible primes that 
can divide the squarefree part d is finite, and we need to investigate a finite number 
of curves Cy (which, in principle, should be easier than doing so for C) to find all 
rational points on C. 


2 Examples of Sharp Curves of Genus Two 


2.1 Two Known Sharp Curves in Genus Two 


We first mention two known examples. 


Example 1 ([Gra94]) This is the first known example of a sharp curve, so we see 
that it took almost 10 years after Coleman’s paper [Col85a] until the first example 
appeared. Let C be the curve given by the equation 


C:y=xx 1)(« — 2)(x — 5)(x — 6). 


This curve has good reduction at p = 7, and it has eight F7-rational points 


C7) = {(0, 0), (1, 0), (2, 0), (3, £1, (5, 0), (6, 0), oo}. 
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In [GG93], it is computed that J(C)(Q) = Z x (Z/ 2Z)*. We can apply Theorem 2 
to obtain that C(Q) < 8+ 2 = 10. We can find ten rational points, so 


C(Q) = {(0, 0), (1, 0), (2, 0), (3, +6), (5, 0), (6, 0), (10, £120), oo}. 


Example 2 ([HM19]) This example is miraculous because it arises from a geo- 
metric problem. Namely, this curve occurs in analysing the problem whether there 
exists a pair of one right triangle and one isosceles triangle, both with rational sides, 
having the same area and perimeter. More details about the problem can be found 
in [HM19]. The curve C is given by the equation 


Coy Se Heb S32 


We compute that the rank of its Jacobian over Q is equal to r = 1, that C has good 
reduction at p = 5, and that its reduction modulo 5 has exactly eight points. All 
conditions for Theorem 2 are satisfied; thus we conclude 


5 217 
CQ = {0 2), A, 2), (-1,2),(2 ) cou} 


6’ 216 


2.2 A Finite Family of Sharp Curves of Genus Two 


We construct new examples of sharp curves of genus two. Let C be a curve. We 
focus on genus two curves because the computation of the rank becomes more 
difficult for larger genus. Thus, the bound is given by 


#C(Q) < #C(F,) +2. 


Since we want this bound to be sharp, we would like that two residue discs have 
extra points (this condition is natural due to the hyperelliptic involution), and other 
ones have exactly one rational point. The easiest way to achieve this is to require 
a small number of residue discs. We will construct a suitable monic polynomial 
Ff (x) of degree 5. One natural choice for a prime p to consider is p = 11, because 
x> =0,+1 (mod 11). If we want to include other monomials in x, we want them to 
have a coefficient divisible by 11, to control the polynomial f(x) modulo 11 easily. 
It remains to pick the constant term. We want some constant for which we can find 
rational points, but not too many modulo 11. Quadratic residues modulo 11 are in 
the set {0, 1,3, 4,5, 9}, so if we pick the constant term to be 9 modulo 11, we have 
that 


f(x) =x° +9 = {8, 9, 10} (mod 11). 


This implies that 
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CE 11) = {, £3), oo}. 


When the rank of the Jacobian of C over Q is less than two, by Coleman’s bound, 
C can have at most 5 rational points. 

Note, as in [Gra94], that we should check if a curve C has absolutely irreducible 
Jacobian. If this were not the case, then J(C) would be isomorphic to the product 
of two elliptic curves over some number field K, and one of them would have rank 
zero, providing an easier way to determine rational points on C. In [Gra94], David 
Grant together with Jaap Top, proved absolute simplicity of the Jacobian of the 
curve from Example | by proving that its L-function cannot be a product of two 
L-functions of elliptic curves. In the meantime, in [HZ02], Everett W. Howe and 
Hui J. Zhu found a criterion when an abelian variety over a finite field is absolutely 
simple. 


Lemma 5 ((HZ02, Lemma 8]) Let g be a prime power and n > 2 an integer. 
Suppose that m is an ordinary Weil q-number (the minimal polynomial for 1m 
is a characteristic polynomial of Frobenius of an ordinary abelian variety). Let 
K = Q(z), K* its maximal real subfield, andn = [K* : Q]. Suppose that 


(1) the minimal polynomial of m is not of the form x7" + ax" + q", 
(2) the field K* has no proper subfields other than Q, 
(3) the field K™ is not the maximal real subfield of a cyclotomic field. 


Then the isogeny class corresponding to m consists of absolutely simple abelian 
varieties. 


We finally give a number of examples. 


Proposition 6 Let Cy : y7 = f(x) be a hyperelliptic curve over Q, where 


f(x) =x° + Lx? + (11k +3)?, k € {0, 1,2, 3,7, 10, 11, 12, 15, 21, 22, 31, 40, 42, 44, 47, 50} 
or 


f(x) =x° + 1x4 4+ (11k — 3), ke {1,4,9, 15, 16, 17, 19, 27, 28, 31, 40, 41, 42, 43}. 


(1) Then #C;(Q) = 5, and in the first case 


Ce(Q) = {(0, EC 1k + 3)), (-11, £C1k + 3)), ov}, 
whereas in the second case 


Cx(Q) = {(0, EU1k — 3)), (-11, EC1k — 3)), oo}. 


(2) All curves Cx from (1) have absolutely simple Jacobian. 
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Proof 


(1) In both cases, we run the Magma command RankBound to compute an upper 
bound on the rank of the Jacobian of curves C; for 0 < k < 50. We listed above 
all k’s in both cases for which the upper bound is less than two. As we saw, 
p = 11 isa prime of good reduction of all curves Cx and #C,(F11) = 3, so we 
conclude by Theorem 2 that #C;,(Q) < 5. The fact that we have five obvious 
rational points on each C; finishes the proof. All listed curves Cx are sharp and 
have rank r = 1. 

Using Magma and Lemma 5 we check that all J(C,) are absolutely simple 
by finding a prime p such that the reduction of J(C;,) modulo p is absolutely 
simple. 


(2 


wm 


Oo 


2.3 An Example with Descent and a Sharp Curve 


There are many examples of applying descent from curves of higher genus to 
curves of genus zero or one in the literature. Lack of examples of curves whose 
rational points can be determined by applying descent to curves of genus at least 
two motivated us to construct one such curve. 


Proposition 7 Let C/Q be the hyperelliptic curve given by the equation 
C:y? = f(x) = («© + 11x? + 64x + 729) (x7 + 11x4 + 64). 


The set of rational points on C is 


C(Q) = {(0, £216), (—11, +40), oo}. 
Proof As we can see, the polynomial f is monic, so when applying the descent 
method we need to compute the resultant of two polynomials, whose product is f, 
which is 


R := Res(x® + 11x° + 64x + 729, x° + 11x* + 64)) = 3°. 


We are only interested in the radical of the resultant, since it is enough to consider 
squarefree numbers, and rad(R) = 3. Thus, after applying the descent, we know 
that x corresponds to an x-coordinate of a rational point on one of curves 


xt 1x4 4+64=dz*, x©4+11x°4+ 64x +729 = dt? 


for some d € {—3, —1, 1, 3}. 
We first prove that d < 0 is impossible. If x> + 11x++64 < 0, then x < 0 and so 
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x(x? + 11x4 + 64) + 729 > 0, 
giving a contradiction. 
If d = 3, then we can use the linear change of coordinates X = 3x, Y = 33z, 
and then the equation of the curve 
C3: x9 + 11x4 + 64 = 327 
transforms to 
Cy: X°4+33X4+4+ 64-3 = Y?. 
For the curve C3, we compute in Magma that J (C3)(Q) = (0). We know that 
C,(Q) embeds into J(C3)(Q), so C3 has at most one rational point: this is the one 
at infinity. 
If d = 1, then we consider the hyperelliptic curve 
Ci: x0 + 1lx4 +64 = 2, 


which is one of the curves in Proposition 6, and we already know 


Ci (Q) = {(0, £8), (—11, £8), oo}. 


We see that all of these points give rise to the rational points of C, and that 


C(Q) = {(0, £216), (—11, +40), oo}. 


oO 


Comment Letr be the rank of the Jacobian of C over Q. Assuming the Generalized 
Riemann Hypothesis, Magma gives that 2 < r < 4. So, in principle, we would be 
able to apply the abelian Chabauty method to C. However, this seems to be more 
difficult than the approach in the proof, which is furthermore unconditional. As we 
can check in Magma, the Jacobian of C is absolutely simple. 


2.4 A Sharp Curve with the Smallest Possible Number of Points 


We give an example of a sharp curve that satisfies the property that it has the smallest 
number of rational points amongst all sharp curves of genus g > 2. For a sharp curve 
C of genus g > 2 it holds #C(Q) > 3. Indeed, we know that in that case 


#C(Q) = #C(F,) + 2g —2> #C(F,) +2 > 2. 
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Therefore, the set C(Q) is non-empty, implying that also C(F p) is non-empty, 
giving the desired conclusion. If we want to construct such a curve, we want a 
curve of genus two with only one residue disc. If we consider a hyperelliptic curve 
C: y? = f(x), where f is a monic polynomial of degree 5, we already know that 
we have one residue disc at infinity. So, we want to construct a curve having no 
other residue discs, meaning that the reduction over F, can have only the point at 
infinity. We will use a similar strategy as before, so that 


C:y? =x 4+c/Fy. 


If c =7 (mod 11), then C has only one rational point, C(Fi1) = {00} because 6, 
7, and 8 are not quadratic residues modulo 11, and C has good reduction at p = 11. 
There should be two more rational points on C. Let one of them be P = (a,b). 
Then P maps to infinity modulo 11. Furthermore, we know that the denominator of 
a is divisible by 11? since otherwise f (a) is not a square of a rational number. Let 
us try to find an example such that a is already a square of a rational number and 
that a is a zero of a polynomial f(x) — x°, with f satisfying the conditions above. 
Let C be the hyperelliptic curve over Q defined by 


C:y*=x°4121x —4. 


Magma gives us that 7, the rank of the Jacobian of C over Q, is zero or one. We can 
apply Theorem 2, and by the previous discussion, 


4 2° 
C(Q) = {( sr a5) ‘ 20] 5 


We also know that r = 1 in this case by Corollary 4. We could also conclude this 
in another way: Using Magma we see that the torsion subgroup is trivial, which is 
impossible when r = 0 and #C(Q) > 1. 


2.5 Examples on Improving Lower Rank Bounds in Magma 


We now present two examples of excessive curves, for which we can improve 
Magma’s lower bounds for the rank. In both examples, we can determine the rank 
of the curve, although Magma gives inconclusive information. 

Again, it is desirable to have a small number of residue discs, and in particular, 
we would like only one residue disc. This time, we will consider p = 5 and the 
curve 


C : y*=8x°—314x°4+3250x4— 10000x7 +64x = x(2(x —25) (4x —25) (x? — 8x7) +64). 
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The reduction of C modulo 5 is 
C:y* =xBx° +x4-1)/Fs 


We easily see that C has good reduction at p = 5 and that C(Fs) = {(0, 0)}. We 
can find at least five points on this curve: 


{0 0), (>. +20) , (25, +40)| c C(Q), 
so C is excessive. Magma gives us that the rank r of the Jacobian of C satisfies 
0 <r < 2. Hence, we conclude that r = 2. 

Note that we do not find the set C(Q) here because Chabauty’s condition r < g 
is violated. It might be possible to compute C(Q) using Quadratic Chabauty. 

The following example is conditional, i.e., for the rank computations in Magma 
we need to assume the Generalized Riemann Hypothesis. We can do the same for 
the following curve 


Cy? =x° — 12(121x — 1)(121x — 4) 


looking at the good prime p = 11. This curve has at least five rational points, so it 
is excessive. Magma gives us that the rank of its Jacobian over Q is between zero 
and two, thus, it is two. 


Remark We see that a curve C/Q with many rational points (at least 2g — 2) might 
be potentially sharp or excessive. There is an algorithm to check that. We need to 
determine all prime numbers p for which C can be potentially sharp or excessive at 
p. We use the Hasse-Weil bounds and the number of known rational points to give 
an upper bound for primes p, so we need to check finitely many of them. Therefore, 
if we find that the curve is potentially sharp or excessive, we obtain a lower bound 
for the rank, r > g — 1, orr > g, respectively. 


Note that here we only considered primes of good reduction. There are results 
similar to Coleman’s bound when we consider primes of bad reduction, and these 
can be found in [LT02] and improved, with a bound in Stoll’s style, in [KZB13]. 
It would be interesting to investigate the case of bad reduction in the future. In this 
case, we might be able to construct infinite families of examples with sharp bounds. 
We could then allow multiple Weierstrass points in the same residue discs, which 
makes computations of ranks of curves easier and hope that we can determine the 
ranks of Jacobians for an infinite number of curves. 
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3 Examples of Sharp Curves of Higher Genus 


We present a few sharp curves of genus g > 2. There is a way to construct 
potentially sharp curves, which works for any genus g and the construction is 
contained in Sect.3.3, including examples of sharp curves of genus g = 4 and 
g = 5, one of each. In some cases, there is an easier way to find examples of sharp 
curves, and it is presented in the following subsection. 


3.1 Concrete Examples of Sharp Curves of Genus Three, Four 
and Five 


Let g be a positive integer. In the case where 2g + | or 2g + 3 is a prime number, 
we can construct examples of potentially sharp curves in the following ways. 

If 2g +1 = p, where p is some prime number, then x78+! — x = 0 (mod p) for 
all x < F,. If c is any quadratic nonresidue modulo p, then for the curve 


C: y? = x28tl —x+c/Fp 


we have C(F p) = {oo}. So, all possible rational points on the curve C reduce 
modulo p to oo, thus are not integral and have denominators divisible by p. Let 
a\,..., @g—1 be integers not divisible by p, with distinct absolute values. Let b € Z 
be any inverse of (a)... fig)" modulo p. The following curve is potentially sharp: 


Ciytaxtetty b(at ar pees (Cam — p’x)(e— x), 


If 2g +3 = p > 3 is prime, then we can use the property that there are two 
consecutive quadratic nonresidues modulo p. If p = | (mod 4), then between 2 
and p — 2 there are pot quadratic nonresidues implying that two of them must be 
consecutive. If p = 3 (mod 4), —1 and —4 are quadratic nonresidues, so if —2 
or —3 is a quadratic nonresidue, we have the conclusion. If not, then 2 and 3 are 
consecutive quadratic nonresidues. Denote by c and c+ | two consecutive quadratic 
nonresidues modulo p. Then the curve 


Cry? = x78 4 c/F p 
has two F',-points, both at infinity because, for x € Fp, we have that 
x78t? 4 ¢ = {c,c +1} (mod p). 
We construct examples from this curve. Let aj,...,dag—1 be distinct integers not 


divisible by p. Let b € Z be any inverse of a, ---adg_; modulo p. The following 
curve is potentially sharp: 
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C': 4 = x78? 4 b(ay — PX) +++ (Ag—-1 — px)ec. 


We summarize the conclusions above into the theorem. 


Theorem 8 Let g > 2 be a positive integer such that 2g + 1 or 2g + 3 is a prime 
number. Define C and C' as above in each case. If r, the rank of the Jacobian of C 
over Q, satisfies r < g, thenr = g — 1, and 


g+l gtl 
C’@= (2+ ) spce( | ee 
po pt P per 
Proof Follows from the above and Corollary 4. Oo 


We give one example of a sharp curve in each genus g € {3,4,5}. For these 
curves we are able to check the rank condition r < g. However, we need to assume 
the Generalized Riemann Hypothesis for the rank computations in Magma. The 
curves are as follows: 


C3: y? =x! — (49x — 1)(49x — 36)(x + 1), r(J(C3)(Q)) = 2, 


t 21 36 67 
ae {(35:*77) (Fe *57) | 


Cary? =x! (11x — 3) 1x — 4)(11x — 6), r(J(C4)(Q)) = 3, 


3 3 A. we 6 @& 
C. = yak ; Gat ' Mee ,OO“LFS 
4(Q) (+ a) (= =a) (5 a) co. | 


Cs: y> =x)? — (13x — 1)(13x — 2)(13x — 3)(13x — 12), r(J(Cs5)(Q)) = 4, 


C5(Q) 1 1 2 - 26 3 2: 3° 12 12° 
= 3 = 9 ’ 5 3 ] + 3 = ’ CO4 > 
2 ia” 136 13’ 136 13’ 136 13° 136 
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3.2. Bertrand’s Postulate for Primes Modulo 8 


For our construction, we will need a stronger version of Bertrand’s Postulate. We 
formulate one version first, and then we cover smaller cases. 


Proposition 9 Let n > 15 be a positive integer. In the interval [n,2n) there is a 
prime number p such that p =5 (mod 8). 


Proof Let P denote the set of prime numbers in Z. For any coprime integers k and 
1, we define the function 


6(x: k,l) = > log(p). 


peP, p<x, p=l (mod k) 


Theorem 1, for k < 13, from [RR96], states that 


max 
I<y<x 


y 
pO tee Se, 
as =" ote 


where one can take ¢ = 0.00456 for x > 10!°. It implies that when y = x 
A(x: 8,5) — ~| < 0.00456~ < 0.005~ —> 0.995~ < 6(x; 8, 5) < 1.005 
X; 6, Fi SU. a SU. 4 5 a Ss X56, sl. 4 
for x > 10!9. If x > 10!°, then also 
x 2x 
6(x; 8,5) < 1.0057 < 0.995 < 6(2x; 8,5) 


giving that 6(2x; 8,5) > O(x; 8,5) for x > 10!°. It follows that there is a prime 
congruent to 5 modulo 8 between x and 2x. We need to prove the statement for 
the remaining interval [15, 10!°]. We can easily find a list of primes congruent to 5 
modulo 8 such that the next one is larger than half of the previous one (e.g., we can 
use Magma for it), which completes the proof. For instance, we can take the list to 
be 


10000000061, 5000000141, 2500000117, 1250000077, 625000069, 312500077, 156250093, 
78125141, 39062581, 19531381, 9765757, 4882957, 2441573, 1220797, 610429, 305237, 


152629, 76333, 38189, 19141, 9613, 4813, 2437, 1229, 653, 349, 181, 101, 53, 29. 


oO 


Proposition 10 Let n > 2 be a positive integer. Then in the interval [n, 2n) there is 
a prime number p such that p = 3 (mod 8) or p =5 (mod 8). 
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Proof By the previous proposition, we only need to check small cases, but the 
statement is trivially true since 3, 5, 11, 19 are primes that fill in the missing gaps. 
oO 


As a corollary, in each interval [n, 2n), where n > | is a positive integer, there is 
a prime p such that 2 is not a quadratic residue modulo p, which is the property we 
will use in our construction of curves in the following subsection. 


3.3 Potentially Sharp Curves of Genus g > 2 


Let g > 2 be a positive integer. We will construct potentially sharp hyperelliptic 
curves C : y* = f(x) of genus g, where f is a monic polynomial. For simplicity, 
we present the construction and ideas in a simpler way. We add a comment on how 
we can slightly generalize it using the same ideas after Theorem 12 and use these 
generalized ideas to construct concrete examples. 

By our version of Bertrand’s postulate, Proposition 10, there is a prime number 
p, congruent to 3 or 5 modulo 8, that is contained in the interval (2g + 2,49 + 4). 
Define the polynomial 


O(a) = 128+? — yr 4 y28t? st, 


Then we have the following congruence modulo p: 


2x28+2 (mod p), if (=) = 
Q(x) = 42 (mod p), if (=) af, 
P 


1 (mod p),if p |x. 


Since p is chosen so that 2 is quadratic nonresidue modulo p, it follows that 


CE>p) = {(, £1), cos}, 


if C is the hyperelliptic curve over F p» defined by y? = Q(x). 

For each s < g, s € N, we will construct curves C; : y? = H,, whose reduction 
modulo p is C and which will have at least 4 + 2s rational points. Four rational 
points on Cy will be from the set {(0, +1), oo+}, and we need to add 2s points, i.e. 
s values of x for which H, (x) is a square of a rational number. We will construct 
Hy such that new rational points on C; are integral, so that all x-coordinates will be 
integers divisible by p. We note that then C,_) is a potentially sharp curve and Cy, 
excessive. 

Let aj,...,as € Z be distinct and non-zero. We will construct the polynomial 
H, such that H;(pa,), Hs(pa2), ..., Hs(pas) are squares of integers. The strategy 
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is as follows. All monomials x* with k > s will be replaced by 
ts(x*) = x" (ax — pai)(x — paz) ++ (x — pas). 


We extend this transformation linearly so that we can compute f;(P) € Z[x] for all 
polynomials P € Z[x] that have only monomials in x of degree greater than s. This 
transformation has two important properties, namely for all such P € Z[x], we have 


ts(P) = P (mod p), 
ts(P)(O) = t;(P) (pai) = +++ = ts(P)(pas) = 0. 


We can apply this transformation to the higher degree part of Q(x), e.g. to the 
polynomial x78+? — xr (note that pel > g > s). We need to deal with the 
remaining part 1 + x’, where / := 2g + 2 — a If 7 > s, then we can apply ft, to 
x! and define a polynomial 


p—1 
Zs (x) 2= ty(x78t? — x" 4x) 41, 


If s > J, the polynomial 1 + x! will be transformed into a part of a square of some 
polynomial. In this way, we can assure that H;(paj;) is a square fori = 1,...,5. 
Denote m € N the number for which ml < s < (m+ 1)/. We use the following 
lemma. 


Lemma 11 There exist integers cy, C2, ..., Cm such that 
(tix! + cox +++ temx™)? =1+x! (mod p,x°), 


meaning that these two polynomials agree modulo p up to degree s. 


Proof We prove this lemma constructively. We can start by taking c; = pel (or 
any other number congruent modulo p to it). Then, looking at coefficients with x/’, 
where j € {2,...,m} we have 


2c; + S; =0 (mod p), 


where S; is some polynomial in the previous coefficients c), ..., cj-1. Since p is 
odd, 2 is invertible and we can determine c; modulo p. Oo 


If c; are the coefficients from the previous lemma, then 
(Lt cpx! + cx $+ + emx")? = Gs (x) + Ls (x), 
where deg(L;) < s and Ls(x) = 1+x! (mod p), and all monomials in x in G; have 


degree at least s + 1. Note deg(G;) = 2ml < 2s < 2g + 2. Define the polynomial 
Zs as 
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a pete a gS 
Zs (x) = ty(x7#4? — xT — Gy(x)) + G(X) + L(x). 
In both cases (J > s ands > /) foralli = 1,..., 5, we have 
Zs(pai) = Gs(paj) +L s(paj) = (+e1p'aj +02 pa +...+¢mp™ al")? =: BF, 


where, if 1 > s, we have cy) = --- = Cm = 0, i.e. bd; = --- = bs = 1, Z,(0) = 1, 
and 


p—1 
Zs(x) = x28t2 — x2 4.x! 41 = Q(x) (mod p). 
We can change Z; by any polynomial px(x — paj)---(x — pas)R(x), where 
R € Z[x] has degree at most 2g — s, and the new curve will still have all known 
rational points and the same reduction modulo p. So, for any choice as above, define 


Cy: y? =Hy(x), Hy(x) = Zs(x) + px(x — pai) ++ (x — pas)R(x). 


Note that we can take b; = 1 (mod p), so bi 4 0, and we indeed have at least 
4 + 2s rational points on Cs: 


{oo4, (0 +1), (pay, +b), © 838) (pas, xbs)} Cc C;(Q). 


Therefore, we constructed many examples of potentially sharp curves C,_; and 
curves C, of large rank. Let r; be the rank of the Mordell-Weil group of the Jacobian 
of C, over Q. We summarize this in the following theorem. 


Theorem 12 


(1) Lets = g—1, i.e, let Cg_; be a curve defined as above. If rg_, < g, then 
re-1 = g — l, and 


Cg-1(Q) = {o, (0, £1), (pay, £b1), ..., (pag—1, Ebe—1)}. 


(2) Lets = g, i.e., let Cg be a curve defined as above. Then rg = g. 
Proof 


(1) By construction, Cg; is potentially sharp. If the rank condition is satisfied, 
then C,_| is sharp, hence has exactly 2g + 2 points and rank rg_; = g — 1. 
(2) The curve Cg is constructed to be excessive, so rg > g. 
oO 


Comment In the spirit of the ideas in the construction, we can change the curves 
in the following ways. 


(1) The transformation t, can be slightly changed to (k > e; +--- +s) 
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fet junjey (*) = F-48185 (x — pay)®! (x — pan) ++ (x — pas). 


(2) The numbers aj,..., das; do not have to be integers; we can allow them to be 
rational numbers such that aj = am GCD(bj;j,c;) = 1, p | bj for alli = 
1,..., 5. But we need to ensure that the curve still has two points at infinity. 
Note that the resulting polynomial, after a linear change of variables, might not 
be monic anymore. 

(3) We can change anything modulo p, so that the reduction of the curve stays the 
same, as long as we do not change the number of known rational points. For 
example, the constant term of the polynomial can be any square of an integer 
congruent to 1 modulo p, or we can change the polynomials constructed in 
Lemma 11, or increase their degree until it is not greater than g, and similar 
constructions. 


We now want to construct some concrete examples of sharp curves from the 
previous family. To this end we need to check is whether the rank assumption is 
satisfied. In a sense, we want to find curves with many rational points and small 
rank of its Jacobian. 


Remark There is a program started by Noam Elkies, and continued by Michael 
Stoll and Andreas Ktihn dealing with a very similar question: Can we construct 
curves of genus 2 with many rational points whose differences are contained in 
the same arithmetic progression inside the Jacobian? Such curves have small rank 
despite having many rational points. In particular, we can apply the method of 
Chabauty and Coleman for these curves, at least with respect to the subgroup of the 
Jacobian, which contains the arithmetic progression. In many cases, this subgroup 
is a finite index subgroup of the Jacobian. Some examples can be found in [Elk] and 
[Sto]. 


We now look at some of their examples and we determine whether they are sharp. 
In [Elk], we can, e.g., find the family 


C(b) : y? = — x? +bx + (b+ 1))* — 4b(6 + Dx. 


The curve C (b) has rational points 00+, (0, +(b+1)), (1, £1), and (—1, +(2b+1)). 
We denote g := [oo — co_]. Then [(0, —(6 + 1)) — (0,b + 1)] = 5q, [, 1) - 
dd, -1)] = 134, ((—1, 2b + 1) — (-1, —@b + 1)] = 29q. One special curve, for 
b= - 3, has six pairs of points in the arithmetic progression. The curve C(— 3) has 
to more pairs of rational points, (3, +6), and G: +4), and [(3, 6) — (3, —6)] = 6lq, 
[(5, g) — (3, —§)] = 83¢. 

The rank of J(C(-3))@ is equal to 1, and it is almost sharp, since it can be 
proven using the method of Chabauty and Coleman combined with the Mordell- 
Weil sieve that #C(Q) = 12, whereas #C(F7) = 11. 

There is another curve, found by Stoll, C : y7 = (x + 2)(x? — 3x + 6)(4x? + 
8x?—3x-+3), that is sharp and is interesting because it has 13 rational points and still 
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rank 1. In this case, the differences of non-Weierstrass points and their images under 
the hyperelliptic involution do not form an arithmetic progression, but an arithmetic 
progression modulo the torsion subgroup of the Jacobian. 

In the end, we mention the curve from [Sto], C : y? = 25x® + 20x° — 76x4 — 
134x3 + 124x? + 96x +9. This curve is interesting because its Jacobian contains a 
rational point of canonical height ~ 0.000298, which is the smallest known positive 
canonical height on a Jacobian surface over Q. It turns out that this curve is sharp. 


Based on the construction of curves Cg; from Theorem 12, we searched for 
examples of sharp curves using Magma. We were unable to find sharp examples of 
genus two or three. For these genera, the generic rank of the subgroup generated 
by differences of the rational points on the curve appears to be at least g. We give 
one example of a sharp curve of genus four and one example of a sharp curve of 
genus five. For both examples, we assume the Generalized Riemann Hypothesis to 
compute the rank. We did not check the rank conditions of curves of genus at least 
six, since this is an extremely computationally difficult task. 

We start with a curve of genus four, and we choose the prime p = 11. Then 
the reduction modulo 11 is the curve C : y? = x!9 + 1. Since there are no small 
powers of x, we only need to add more rational points using some convenient t¢ 
transformation. We consider the curve 


Cy? =x4(x — 11)? (x — 22)*(« — 33)? +1. 
This curve has the property that the element [oo_ — oo ] of the Jacobian has 


finite order 5, increasing the chances to obtain smaller rank. Indeed, it is verified 
in Magma that this curve has rank 3, so it is a sharp curve of genus four, and 


C(Q) = {oox, (0, £1), (11, £1), (22, £1), 33, +1}. 


We use a similar strategy to construct a sharp curve of genus five. We choose 
p = 13 and start with the reduction curve C : y? = x!* + 1. Then we consider the 
curve 


or, after the linear change of variables 
C: y? = x4(9x? — 169)*(16x7 — 169)? + 1447. 


This curve is potentially sharp, as #C(F13) = 4, and #C(Q) > 12. Hence, the rank 
of its Jacobian satisfies r > 4. 

The group of automorphisms of C contains the group (Z/2Z)*, so by [Pau08, 
3.1.1., Theorem 5], C has two quotients of degree two, C;, and C2, such that J(C) 
is isogenous to the product J(C,) x J(C2). We conclude that the rank of J(C) is 
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equal to the sum of ranks of J(C1) and J(C2). Using Magma, we compute that C; 
and C2 are given by the equations 


Cry? +(x? +x) y=5184x9+4+37944x>—802992.x4— 35809 10x3+35285157x2+326553 15x+7268209, 
C2: y2=—20736x! —55584x°+4461023x°+5168390x*—29428080 1x? —78257 128x2+4786378304.«.+ 
8098228736. 


Furthermore, for both curves C; and C2, Magma gives the lower rank bound 0, and 
the upper bound 2. We conclude that both J(C;) and J(C2) have rank 2, because 
the sum of their ranks is at least 4. So the Jacobian of C has rank 4. Hence the curve 
is sharp and we have 


C(Q) = foes, (0 144, ( “a 12). ( 12). 


We note that it might be possible to use the method of Chabuaty and Coleman to 
find the rational points on C>, and to use that information to find C(Q). However, 
the curve C> is not sharp, so it would be more involved to compute the set C2(Q). 
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Chabauty—Coleman Computations on M®) 
Rank 1 Picard Curves erty 


Sachi Hashimoto and Travis Morrison 


Abstract We provably compute the full set of rational points on 1403 Picard curves 
defined over Q with Jacobians of Mordell—Weil rank | using the Chabauty—Coleman 
method. To carry out this computation, we extend Magma code of Balakrishnan 
and Tuitman for Coleman integration. The new code computes p-adic (Coleman) 
integrals on curves to points defined over number fields where the prime p splits 
completely and implements effective Chabauty for curves whose Jacobians have 
infinite order points that are not the image of a rational point under the Abel—-Jacobi 
map. We discuss several interesting examples of curves where the Chabauty— 
Coleman set contains points defined over number fields. 


1 Introduction 


Let X/Q be a smooth projective curve of genus at least 2. Faltings’s theorem 
[Fal83], proved in 1983, establishes that the set of rational points X(Q) of X is 
finite, but falls short of giving an explicit method for computing this set. Our goal 
in this paper is to compute this set explicitly for certain curves of genus 3. 

Let p be a prime of good reduction for X. We focus on curves amenable to 
Chabauty—Coleman calculations, which use p-adic (Coleman) integrals to carve out 
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a finite set of p-adic points containing the rational points. The Chabauty—Coleman 
method is one attempt at an effective Faltings’s theorem. The method dates back 
to work of Chabauty in 1941 [Cha41], who showed finiteness of X (Q) for certain 
curves X using the embedding of X(Q) into the p-adic closure of J(Q), where 
J is the Jacobian variety of X. In the 1980s, Coleman [Col85a, Col85b] made 
Chabauty’s approach effective with the development of the Coleman integral. With 
this computational tool, he was able to bound the number of rational points on some 
curves. 

For a given curve X, the Chabauty—Coleman method requires that the Mordell— 
Weil rank of its Jacobian be computable and smaller than the genus of X. In this 
paper we apply the Chabauty—Coleman method to genus 3 non-hyperelliptic curves. 
While there exist techniques to compute the rank of Jacobians of general smooth 
plane quartics [BPS16], practical implementations of rank-bounding techniques 
only exist for superelliptic curves, based on work of Poonen and Schaefer [P0097]. 
A Picard curve is a smooth plane quartic X/Q that admits a Galois cover of degree 
3 of P!. Every Picard curve has an affine equation of the form y? = f(x) where 
f(x) € Q[x] is degree 4 and without multiple roots in Q. Therefore we focus on 
Picard curves. While for some particular genus 3 curves X of Mordell—Weil rank 2 
it is possible to determine the rational points using the Chabauty—Coleman method, 
and one can always extend the method using the Mordell—Weil sieve [Sik15, BS10], 
which combines p-adic information for multiple primes p, when the rank is one less 
than the genus we expect the Chabauty—Coleman method alone to fail to determine 
X (Q) for most X. So, in particular, we focus on the rank 1 case, where we can work 
with at least two linearly independent functionals on J(Q,). 

Recently, Balakrishnan and Tuitman [BT20] have made Coleman integration 
explicit and practical, opening up the ability to compute rational points on entire 
databases of curves satisfying Chabauty’s rank hypothesis. 

Sutherland has computed a database of genus 3 curves of bounded discriminant 
[Sut19, Sut], providing the necessary data for large-scale computational experiments 
in genus 3. Balakrishnan—Bianchi—Cantoral-Farfan—Ciperiani—Etropolski [BBCF* ] 
computed rational points on a database of genus 3 hyperelliptic curves with 
Mordell—Weil rank 1 using Chabauty—Coleman methods and de Frutos-Fernandez— 
Hashimoto [dFFH19] computed rational points on genus 3 hyperelliptic curves with 
Mordell—Weil rank 0. This work is the first instance of such computations being 
carried out on non-hyperelliptic curves. 

The Chabauty—Coleman method cuts out a finite set of p-adic points X (Q,)1 on 
acurve X that contains the rational points X (Q); sometimes this set is strictly larger 
than the set of rational points. 

For every Picard curve X we consider, we classify the set of extra points 
X(Qp)1 \ X(Q) in the Chabauty—Coleman set. We show that each extra point P 
is defined over a number field, that is, we can find a number field K where p is 
divisible by a prime p of degree one dividing p, and P € X(K)M X(Ky). Let 
co € X(Q) be the unique rational point at infinity. These extra points fall into one 
of three categories: 


Chabauty—Coleman Computations on Rank | Picard Curves 487 


¢ P € X(K) such that [P — oo] isin J(K)tors3 
¢ P € X(K) such that there exists n € Z so that n[P — oo] is infinite order in 
JQ); 


¢ P € X(K) that appear due to the existence of extra automorphisms of XG: 


We exhibit examples of all three possibilities in Sect. 4.1. 

Our Magma code extending the functionality of Balakrishnan and Tuitman’s 
Coleman integration code as well as our code for computing X(Q))1 on the 
database of Picard curves is available at [HMb, HMa]. 


2 The Chabauty—Coleman Method 


2.1 Chabauty—Coleman 


In this section we introduce the Coleman integral as a p-adic tool for computing 
the rational points on a curve. We follow the exposition of the construction of the 
Coleman integral in [Wet] and [MP12]. Let X be a smooth projective curve of genus 
g > 1 over Q with Jacobian J. Let B be a nonzero rational divisor on X inducing 
an Abel—Jacobi map 


wxXodJd 
Q +> [deg(B)Q — B] 


(for example we can always take a canonical divisor or we can take a rational point if 
one is known). Let p be a prime of good reduction for X. We introduce two ways to 
construct the p-adic Coleman integral of a regular one-form on XQq,,: one by pulling 
back an integral on Jg, thought of as a p-adic Lie group and the other directly on 
XqQ, as a rigid analytic space. We first describe the p-adic Lie group approach and 
do not address the equivalence here; Coleman discusses the details in [Col85b]. 
Let wm € H°(Ja,» Q!). Since J(Qp) is a p-adic Lie group, the differential w 
is translation-invariant, and we can define an antiderivative homomorphism iF : 
J(Qp) + Qyp. This homomorphism sends a point Q € J(Q,) to i w. For any 
open subgroup U C J(Q,), the antiderivative of w can be computed by expanding 
@ in local coordinates at Q € U and taking formal antiderivative in the power series 
ring; this yields a convergent p-adic power series on small enough open subgroups. 
The pullback map 


*: H°(Jg,, 2') > H°(XQ,, 2!) 


induced by 1 is an isomorphism, independent of the choice of B [Mil86, Propo- 
sition 2.2]. This gives us a way to think of the integral on the curve Xq,: for any 


WE H°(XQ,, Q!) and any P, 0 € X (Q,,), we can define i w by [ee a. 
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Conversely, we can write an integral on Jg, in terms of integrals on XqQ,: for any 
D € J(Q,) and @ € H°(Ja,» Q!), we can write D = >7;[Gi — Pi] where 
P; € X(L) for L/Q, a finite extension. Then 


D Qi 
o= “o. 


Note i @ = Owhen D € J(Q»p)tors because the antiderivative is a homomorphism. 


Coleman gives a definition of the integral : 7 on the rigid analytic space XQ,,; 
to do so, we work over a wide open space V C XQ, constructed by deleting a finite 
number of closed disks from XQ, of radius less than 1. He defines an integral for 
P,Q € V and n any differential w of the second kind on V [Col85b] (a differential 
@ is of the second kind if it is everywhere meromorphic and the residue at every pole 
is zero). Coleman shows his definition agrees with the one induced by the p-adic 
Lie group structure of Jg, when 7 € H °(Xq,, Q!). Fix an algebraic closure Q, 


of Q,. For all points P, 0, R € VQ,) and one-forms w, 7 of the second kind on 
XQ,» the Coleman integral enjoys the following properties: 


Theorem 2.1 (Coleman) 


(1) Linearity: [Pot Bn) = a foot BSP n, for alla, B € Qy. 

(2) Additivity: [2 o = [Rot fo. 

(3) Fundamental theorem of calculus: i df = f(Q) — f(P) for df a rigid 
analytic function on V. 

(4) Change of variables: if V' C X’ is another wide open subspace of a rigid 
analytic space X', for any rigid analytic map @: V — V', we have hid o= 
IP ¢*(). 7 

(5) For any divisor D = >); Qi — P; of degree zero on X defined over Q,, and for 
a regular one-form, then [7 @ := ); {p,' is well-defined, and [pw = 0 
when D is principal. 

(6) Galois equivariance: if o € Gal(Q,,/Qp), then (iP w)" = ey a. 


Remark 2.2 Let Q;, P; € Q, such that D := )°; O; — P; is a divisor of degree 
zero on X defined over Q,. Let w € H°(XQ,, Q!). By (5) and (6) of Theorem 2.1, 


the integral 
Qi 
o= a) 
aos 


is well-defined, only depends on [D], and belongs to Q,, even though the individual 
integrals in the sum - w may not belong to Qp. 
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When the rank r of J(Q) is less than the genus g of X, the space of vanishing 
differentials 


Van(X (Q)) = {o € H°(Xg,, 2’): / o = 0 forall D € 1@)| 
D 


is at least (g — r)-dimensional, and thus nonzero. In particular, when r = 1, the 
space of vanishing differentials is exactly (g — 1)-dimensional: for D € J(Q) the 
integrals i q@; on a basis of regular differentials i = 1,..., g are zero if and only 
if D is torsion [Bou89, III, §7.6, Proposition 12]. Since we are guaranteed some 
D € J(Q) which is non-torsion, Van(X (Q)) cannot be all of H°(Xq,, Q!). 


Remark 2.3 We will think of the p-adic integral of wm in Van(X(Q)) as the 
Coleman integral [ p® = vi Fe P, w taking place directly on the curve Xq,, as 
a rigid analytic space (as in Theorem 2.1 part 6). While we could consider both the 
p-adic Lie group integral je (*)~!w on the Jacobian and the Coleman integral 
when wo € H °(XQ,, Q!), in the process of explicitly computing the Coleman 
integral between two points in different residue disks, we compute integrals of 
differentials of the second kind that are not everywhere regular (see Sect. 2.2). 
Therefore throughout the paper we consider all p-adic integrals to take place on 
the curve as a rigid analytic space. 


Coleman [Col85a] proves a bound on the number of rational points #X(Q) < 
#X(F,) + 2g — 2 by bounding the number of zeros of a nonzero differential in 
Van(X (Q)), assuming p > 2g. 

Coleman’s proof suggests an algorithm for computing X (Q) by computing the 
set 


deg(D)Q 


X(Qp)1 = {2 e X(Q,): w = 0 for all w € Van(X(Q))}. 
B 


Since the Coleman integral is locally analytic, it can be expressed as the integral of 
a convergent p-adic power series on each residue disk. Computing X (Qp)1 yields 
finitely many solutions in each disk, and so finitely many in total. Therefore X (Q) € 
X (Qp)1 is finite. 


2.2 Explicit Coleman Integration After Balakrishnan and 
Tuitman 


To carry out the Chabauty—Coleman method in practice, we need an explicit method 
for computing the set X(Q,)1, which is defined as the vanishing locus of some 
p-adic integrals. In this section, we give an overview of a practical algorithm to 
compute p-adic integrals on curves, following the method of Balakrishnan and 
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Tuitman [BT20]. For a more detailed description of the algorithm, which applies 
in more generality than described here, we refer the reader to their paper, as well as 
the zeta function algorithm of Tuitman described in [Tuil6, Tuil7] which develops 
the reduction in cohomology, extending an earlier algorithm for hyperelliptic curves 
due to Kedlaya [Ked01, Ked03]. 

Let X/Q be a smooth projective geometrically integral curve with (possibly 
singular) affine plane model given by the equation Q(x, y) = O with Q(x, y) € 
Z{x, y] and let p be a prime of good reduction. Tuitman’s algorithm exploits the 
existence of a low-degree map x : X — P! of degree dy. After removing the 
ramification locus of this map, the algorithm chooses a lift of Frobenius which sends 
x + x? and Hensel lifts y. Then, we compute the action of Frobenius on a certain 
basis of differentials whose classes generate He Z (XQ,), and reduce using Lauder’s 
fibration method which is given by explicit and fast linear algebra. 

To specify a point P on X, Balakrishnan and Tuitman’s algorithm uses the 
following data: 


¢ P*‘x the x-coordinate of P (or 0 if x is infinite). 

* P*b the values of b° = [ep er bi ,] that form an integral basis for the 
function field Q(X)/Q[x] at x, (bD° = [b>...., be] an integral basis for 
Q(X)/QU1/x] if P is infinite). 

e P*‘inf a boolean value specifying whether P is infinite. 


We must impose some additional assumptions on Q(x, y), p, b°, and b©, which 
are listed in [BT20, Assumption 2.6], in order to compute p-adic Coleman integrals 
on Q(x, y) = O using Balakrishnan and Tuitman’s algorithm. These hypotheses can 
be verified directly for our case of Picard curves from the computation of b° and b® 
provided here. 

In the case where X is a monic Picard curve y> = f(x), the map x : X > 
P! is projection to the x-coordinate, b° = [1, y, y*], b© = Ly, y/x?, y/x>]. The 
regular differentials w) = ydx/f,w. = xydx/f and 3 = y*dx/f form a basis 
of H°(Xq,, 2!). We extend this basis by w4 = x? ydx/f,@s = xy*dx/f,o6 = 
gy nsf to obtain classes [@ 1], ..., [we] that generate He (XQ,)- 

The p-adic points X(Q,») are a disjoint union of residue disks, which are the 
preimages under reduction modulo p of points in X(F,). After Balakrishnan and 
Tuitman, we classify the residue disks based on their ramification with respect to the 
map x : X > P!. Let A y(x) be the discriminant of Q thought of as a polynomial 
in y, and r(x) be the squarefree polynomial with the same zeros as Ay(x) given by 


r(x) = Ay(x)/ gcd(Ay(x), — ). Then our residue disks partition as follows: 


¢ Bad residue disks are residue disks that contain a point whose x-coordinate is a 
zero of r(x) or is infinite. Such points are called very bad points. If the very bad 
point of a disk has infinite x-coordinate, the disk is also an infinite residue disk, 
and the point is also a very infinite point. Otherwise the disk is a finite residue disk. 

* Good residue disks comprise the remaining residue disks. All good residue disks 
are finite. 
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Note that each residue disk contains at most one very bad point P = (x, y) that 
will be defined over an unramified extension of Q, [BT20, Remark 2.9]. We denote 
by ep the ramification index of the map by x at a very bad point P. If Q(x, y) = 
y3 — f(x) is a Picard curve, then r(x) = —27/f(x), the very bad points are the 
ramification points (where y = 0), the point at infinity oo is [0 : 1 : 0], the residue 
disks containing ramification points are the finite bad disks, and all very bad points 
have ramification index 3. 

There are two kinds of Coleman integrals on X: integrals between two points P 
and Q in X (Q,) in the same residue disk, called tiny Coleman integrals and integrals 
between two points that are not in the same residue disk. Tiny integrals are computed 
using uniformizing parameters. A uniformizing parameter gives an isomorphism 
from a residue disk to pZ, and transforms a Coleman integral into a power series 
integral. 


Algorithm 2.4 Compute a tiny integral [ 4 @ and uniformizing parameter t 
Input: P and Q € X(Q,) in the same residue disk, and w a differential of the 
second kind on X such that w does not have a pole in the residue disk of P and Q. 
Output: ie w and t a uniformizing parameter in the residue disk of P and Q 


(1) If P, Q are ina bad disk, find P’, the very bad point in the disk, and break up 


the integral as i a+ i w. Otherwise proceed with P = P’. 

(2) Compute a uniformizing parameter t at P': in a good disk, t is given by x — 
x(P’); at avery bad point t = be a b?(P’) for some i (replace with b?° for very 
infinite P’). 

(3) Compute x and the b° vector (or b©) as a function of t to desired precision, 
and using these expansions compute w as a power series in t. 


(4) Integrate 
Q t(Q) 
i 0) =a w(t) 
PB t(P) 


For more details on the computations of the tiny integral and the uniformizing 
parameters see [BT20, Algorithm 3.7, Proposition 3.2]. 

For the second kind of Coleman integral, between points in different residue 
disks, we turn to the full machinery of Tuitman’s zeta function algorithm to compute 
the action of Frobenius on cohomology. Let P and Q in X (Q,) be points in different 
good residue disks. We use Tuitman’s lift @ of the Frobenius morphism on Xf,,. 


as a power series. 


Let w; be Tuitman’s basis differentials such that [w;] generate He _ (XQ,) fori = 
1,...,2g. By applying Frobenius to P and Q, we obtain 


Q o(P) $(Q) Q G(P) Q Q 
i w= | ot f otf w= | oi f oro + | Oo} 
- P $(P) $(0) P P $(0) 
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where P and @(P) are in the same residue disk, so define a tiny integral, and 
similarly for Q and ¢(Q). 

So, we expressed the unknown integral as a linear combination of two com- 
putable quantities and the integral of #*w;. This motivates computing $*a,j; 
Balakrishnan and Tuitman compute the reduction in cohomology of ¢*@; as a 
sum of basis elements a M;j@; plus an exact differential df;. We compute the 
reductions of all 6*@; and consider the linear system of equations to compute the 
Coleman integral. For more details see [BT20, Algorithm 3.7, Algorithm 3.12] 
as well as [Tuil7, Section 4.1] ila discusses how to construct the 2g basis 
differentials whose classes generate H,, ‘ig (XQp)- If P lies in a bad disk, some f; 
might not converge at P, but near the boundary of the disk of P they do; we compute 
the integral [ e w by taking S € X(Qp)( p'/°)) on the boundary for e large enough 
and breaking up the integral as [ : otf - @ 

In practice, to compute X (Q),)1, we integrate to arbitrary endpoints 7 w; to do 
this we iterate over residue disks, breaking up the integral by fixing P in each residue 
disk and integrating /, i w+ [> @ where f; fi sets the constant of integration and 
i fs @ is a tiny integral which can be expanded as a p-adic power series to arbitrary 
precision and whose zeros can be found using analytic techniques. Balakrishnan 
and Tuitman implement the Coleman integral as well as the computation of the 
Chabauty—Coleman set X (Q,)1 in Magma [BT] under the assumption that we can 
find rational points P,, P2,..., P» € X(Q) such that the classes [ P; — b] are infinite 
order and linearly independent in J(Q), where r is the rank of J(Q). However, 
their implementation does not check the precision (for example, when solving for 
roots of p-adic power series the code does not check whether all of the roots 
obtained are single roots) and needs a few modifications to produce provably correct 
results. 


3 The Algorithm 


In this section, we describe the algorithm we implemented to compute X (Q), for 
each X arank | Picard curve in our database. These curves come from Sutherland’s 
database of genus 3 curves of small discriminant [Sut19, Sut] and have discriminant 
A bounded by 10!. See [Sut19] for a definition of the discriminant of a smooth 
plane curve; in particular, p|A if and only if p is a prime of bad reduction. Each 
curve in the database is given by the data of an affine model y> = f(x) with f(x) 
degree four, along with its discriminant A. 

We began by filtering the database for those curves whose Jacobians have 
Mordell—Weil rank 1. We ran the Magma function RankBounds (f,3:Return 
Generators:=true) which computes the rank based on work of Poonen and 
Schaefer [Poo97] and Creutz [Cre13] and is implemented in Magma by Creutz. It 
gives a lower and upper bound for the rank of the Jacobian of the cyclic cover of P! 
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defined by y? = f(x) along with a list of polynomials g(7) which define divisors 
div(g) on P! that lift to rational divisors on X. The image of such a divisor in the 
Jacobian is either a 3-torsion point or a point of infinite order. 

If RankBounds returns upper and lower bounds that prove J is rank 1, we 
proceed. Otherwise, we discard the curve and move on to the next one.' For J 
rank 1, we consider the non-torsion generators g(T); if there is some non-torsion 
g(T) of degree one, then we have a rational point P € X(Q) such that [P — oo] 
is infinite order in J(Q). Otherwise, we are left with g(T) of higher degree, 
which correspond to rational divisors that are supported on Galois conjugate points 
defined over a number field K. To handle this second case, which comprises 544 
of the 1403 Picard curves in our rank | database, we extended functionality in 
Balakrishnan and Tuitman’s code to handle rational divisors on curves when running 
effective Chabauty computations. This is done by extending the Coleman integral 
functionality to compute integrals to points on curves defined over number fields 
when p is totally split in the field. 


3.1 With a Rational Point Whose Class in the Jacobian Is of 
Infinite Order 


Algorithm 3.1 Computing X(Q))1 with P € X(Q) such that [P — 00] is infinite 
order 

Input: a Picard curve X with affine equation y? = f(x) such that df is monic, 
with discriminant A, and Jacobian J such that J(Q) is rank 1, along with a point 
P € X(Q) such that [P — co] € J(Q) has infinite order, and parameters (N, e), 
where p is the p-adic precision and Qp(p'’) is the totally ramified extension 
over which we work in the bad disks (see Sect. 2.2). 

Output: S © X(Q) and T © X(Qp,)1 \ S or “Failure”. If the algorithm does not 
return “Failure”, and T = @, then S = X(Q). 


(1) Find the first? prime p > 3 such that p + A. (We discard p = 3 because 3 
always divides the discriminant of a Picard curve [BBW17, Theorem 1.1]. We 
avoid p = 2 for simplicity when Hensel lifting.) 

(2) Compute a list of points S © X(Q) by searching in a box up to naive height 
1000. 


'For practical purposes, we terminated RankBounds if it ran for 120s without returning an answer. 
We discarded 520 curves from the total 5335 curves in the database provided by Sutherland either 
from terminating RankBounds or due to a “Runtime error”. 


For some heuristics for choosing e given a fixed N see Appendix 4.1. 


3Due to a minor bug in the code of Balakrishnan and Tuitman for computing local coordinates 
at very infinite points, which in certain cases yields a Runtime Error instead of computing the 
coordinates, we sometimes choose the next largest possible prime. 
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(3) Compute a basis v1, v2 of vanishing differentials Van(X (Q)) by computing a 
basis for the kernel of the matrix of integrals os aj] fori = 1,2,3 regular 
basis differentials. (We compute v1, v2 to finite precision: let M be the matrix 
of Frobenius and I the identity matrix. Then v1, v2 are computed to precision 
N — ord,(det(M — I)) — 6 as described in [BT20, Section 4] where 6 is a 
precision loss defined in [Tuil17, Definition 4.4]. If N — ordp(det(M — I))—6 < 
0, return “Failure”’.) 

(4) Solve for X(Qp)1 = {2 € X(Qp)| ie vj = 0,7 = 1, 2} using Algorithm 3.7. 
If Algorithm 3.7 returns “Failure” (due to a bad choice of parameters N and 
e), return “Failure”. 

(5) LetT = X(Qpy)1 \S. 

(6) Run the Magma function PowerRelation to see if we can recognize the 
points of T as defined over a number field. If any point Q of T is defined over 
Q, append it to S and remove it from T. 


i. First check if f(x(Q)) = 0; if so, Q is a ramification point, and therefore 
[Q — ov] is 3-torsion. Remove Q from T. 

ii. Compute {2 aj,i = 1, 2,3 on regular basis differentials. If these are 0 to our 
p-adic precision, then [Q — ov] is potentially torsion. We verify it is torsion 
by finding n € Z such that n|Q — co] = 0 € J(Q) using Jacobian arithmetic 
implemented in Magma. If we verify it is torsion, remove Q from T. 

iii. Search in a box for relations with coefficients of small height between [Q — 
oo] and known rational divisors on J(Q). If we succeed, remove Q from T. 
We have explained Q € X (Q)p)1 by linearity. 


(7) Return S and T. If T = @, then S = X(Q). IfT GY, then more work must be 
done to explain the presence of the remaining points in T C X(Qp)1 \ S and 
provably determine X (Q). 


3.2 Precision Bounds 


In Algorithm 3.1 Step 4 we solve for the zeros of the integral of a regular differential 
v on X. To do this, we solve for the zeros in each residue disk of Q € X(Fp), 


: v(t)dt for some QO € X(Q,) reducing 


expanding the integral fv = e v+ ie 
to Q, and where f is a uniformizing parameter at Q computed using Algorithm 2.4. 
In practice, we must truncate the p-adic power series to be a polynomial with finite 
p-adic precision. Therefore we need to determine how much p-adic and t-adic 


precision we need to determine all of the zeros of fig u+ is u(t)dt on the residue 


disk of OQ. 
First we define a normalizing process for p-adic power series. 
Algorithm 3.2 Normalize a p-adic power series f 
Input: f(t) € Qpl[l[t]], a p-adic power series such that f'(t) = Vizo ajt' € 
Zyl[t]], and df has constant term inc € Zp. 
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Output: F(x) € Zp[[x]] \ pZp[[x]] such that the roots of f (t) are in bijection with 
those of F by the map r +> pr. 


(1) Set Fo(x) = f (px). Then Fo(x) =c+ Djs ev 


(2) Set i.:= min ({»» (424)},,, Ubon), 
(3) Return F(x) := Fo(x)/p. 


First, it is easy to see F(x) is in Zp[[x]]. For i > 0, the coefficient of x! is 
i i-h 
oP which has positive valuation for all i > 0 (by assumption the constant term 


has nonnegative valuation). In Step 2, we ensure that F(x) € Zp[[x]] \ pZp[[x]]. 


Lemma 3.3 Let f(t) € Q,[[t]] be a p-adic power series such that f'(t) € Zp[[t]].- 
Write f'(t) = ay + ayt + ant? +... and f(t) = c+ aot +ajt?/2+apt7/34---. 
Let F (x) be the output of Algorithm 3.2 on input f(t), so that 


oo ax! pi 
F(x) = > ee eo 


i 
i=l 


where we assume c € Zp. 
Fix a positive integer N € N. Define 


M(N, A) := min{m — 4 — log, (m) > N} 
m 


and let M = M(N, A). Write F(x) = Fy (x) + Foo(x) where Fy (x) contains the 
terms F of degree at most M, and F5(x) contains higher order terms. 
Then Fy (x) = F(x) mod DN. 


Proof Let bj := ey be the coefficient of x!. 
To show Fy (x) = F(x) mod p”, we show that fori > M, vp(b;) > N. Then 
Vp (bi) = vps) >i-A— log, (@), and fori > M, this is greaterthan N. O 


Note that if c € Qp and vp(c) < 0, then F(x) and therefore f(t) has no zeros, 
so we will not need to consider this case. 


Remark 3.4 In order to work with the prime p = 5 < 2g, we provide a more 
detailed analysis of the precision here than in [BBCF*, Section 3.3]. Their algorithm 
assumes p > 2g in order to obtain explicit formulas for the p-adic and t-adic 
precision. Let b € X(Q). 


Definition 3.5 Let N be a positive integer G € Z,[[x]]. Let G denote the reduction 
of G to Z/pZ. We say G satisfies Hensel’s lemma for r € Z/p%Z if 

* G(r) =0 modulo py and 

° Gr? #0 modulo p’. 
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If G satisfies Hensel’s lemma for r then by Hensel’s Lemma [Con, Theorem 4.1] 
there exists a unique 7 € Z, such that G(r) = O and |r —7r|p < IG (r)|p- 
Furthermore, 


1. |r-—Frlp= IG(r) | p/IE |p and 
2. |G Pip = IE Op. 
Lemma 3.6 Let F(x) be a normalized power series, obtained as output from 
Algorithm 3.2, which converges on Zp. Fix N a positive integer, M = M(N,)d), 
and write F(x) = Fy (x) + Foo(x) as in Lemma 3.3. 

Letr € Z/pNZ such that F(x) satisfies Hensel’s lemma for r. Let? € Zp be the 
unique lift of r satisfying |r —F|p < |Fy()|p- 

Then? =r mod pr Fu), 


Proof Since F satisfies Hensel’s lemma for r € Z/pNZ, we have Fy(r) = 0 
modulo p™. Furthermore, by (1), letting 7 be the unique lift of r satisfying 
IFu()|p > Ir — Fp, we have 


pNP) > |Fu@)|p/|\Fu@)|p = lr —Flp- 


Sor =r modulo pN—(Fy)), Oo 


Every root of F(x) reduces to a root of Fy(x). Conversely, suppose we start 
with a set of roots R in Z/p Z satisfying Hensel’s Lemma for Fy, so that for any 
r,r’ € R, we have 


r&r’ mod pX-%Fu®), (4) 


Then Lemma 3.6 implies that the roots r € R lift uniquely to simple roots 7 of F in 
Zp if Fy satisfies Hensel’s lemma for all r € R. 

We call a set R with property (+) a system of roots for Fy modulo p%. We 
can compute a system of roots iteratively, lifting roots of Fjy modulo p to higher 
precision, as described in Algorithm B.0.1. 


Algorithm 3.7 Solving for the zeros of a tiny integral in a residue disk 


Input: b € X(Q), w € Van(X(Q)), Q € X(Q,) with uniformizer t computed as in 
Algorithm 2.4, and a precision N. 

Output: The zeros in the residue disk of Q of the antiderivative of w with constant 
term iia a, or “Failure”. 


(1) Compute c = ie @ the constant term (if Q is a bad point, c = vis oO+ {a o 
where S € X(Qp(p!*)) is a point near the boundary of the residue disk of Q 
with e large enough so that the integral from b to S converges). 

(2) Compute f'(t) := w(t) to precision N using Algorithm 2.4. 
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(3) Compute f(t) the antiderivative of f'(t) with constant term c, accurate to 
precision N' = N — 6, where 6 is the precision loss when integrating discussed 
in [BT20, Section 4]. If N’ < 0, return “Failure”. 

(4) Compute M(N’, i) and let Fy (x) be as in Lemma 3.3. 

(5) Compute a system of roots R of Fy(x) modulo pN, as described in Algo- 
rithm B.0.1. 

(6) For eachr € R, check if Fy, (r)* # 0 modulo pn’. If this is true for every r, 
these correspond uniquely to the zeros of f(t), as per Lemmas 3.3 and 3.6. If 
not, return “Failure”. 


Remark 3.8 If f(t) has double roots, then Fyy(x) always has double roots. In 
Algorithm 3.1, we solve for common roots of two power series; we check the roots 
are single roots of at least one of the truncations. In the 1403 Picard curves in our 
database, at VN = 15 we found common double roots for a single curve. By using a 
larger prime, we were able to provably compute that curve’s rational points. 


Remark 3.9 According to [BT20, Remark 4.2] the precision M in Balakrishnan 
and Tuitman’s implementation used to truncate F (x) t-adically is chosen so that for 
alli => M, vp(bj) = N for b; the coefficient of x!. The chosen M > M(A, N’) is 
sufficient. 


3.3 Beyond Rational Points 


If we cannot find P € X(Q) such that [P — co] is infinite order in J(Q), then 
we are must instead use a rational divisor on X whose image is infinite order in 
J (Q). Because X is a genus 3 curve with a rational basepoint co € X (Q), we have 
a surjection Sym? (X) —» J [Mil86]. Because we assume the rank of J(Q) is 1, 
such a divisor exists, and the points in the support of the divisor are defined over an 
extension of degree at most 3. 

Using our extension of Balakrishnan and Tuitman’s code, it is possible to 
integrate over any rational divisor of the form D — deg(D)b € X(Q) for rational 
primes p split completely in L, the smallest number field over which the points 
in the support of D are defined, and also to points defined over a number field K 
on a curve X/Q for p completely split in K. The curve X/Q should be smooth 
projective and geometrically integral with (not necessarily smooth) affine model 
Q(x, y) satisfying the same assumptions as in [BT20, Assumption 2.6]. However, 
we focus our discussion on the situation for superelliptic curves where it is practical 
to compute rank bounds. Our code is tailored to this case. For non-hyperelliptic 
superelliptic curves, when computing infinite order points in the Mordell—Weil 
group J(Q), RankBounds outputs polynomials* g(T) such that the divisor div(g) 


“For a given curve there are often multiple choices of g, and we pick the one with the smallest first 
split prime. 
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on P! lifts to a degree deg(g(T)) rational divisor on X under the map X —> P!. Then 
deg(g(T)) is at most the genus of X, which is 3 in our case of Picard curves. (For 
odd degree hyperelliptic curves RankBounds returns the Mumford representation 
of the divisor, and one can recover g by taking the polynomial for the x-coordinate. 
Our code does not handle the even degree hyperelliptic case.) 

Let D be the degree d = deg(g(T)) rational divisor obtained by lifting div(g) to 
X. Then D = P,+---+Py—doo. We use L to denote the splitting field of g, which 
is the smallest number field over which the points P; are defined. To compute the 
Chabauty—Coleman set for X, we need to compute the set of vanishing differentials, 
which is done by taking a basis for the kernel of the | x 3 matrix of integrals of 
the regular differentials over D. The integral [ pi 1s equal to ae ie Bl a;. By 
choosing a prime p that splits completely in L, for any p|p in L we have Ly = Qp. 
Therefore, we can proceed as before and compute these integrals in Ly. 

There are no theoretical innovations needed extend the code to use inert primes 
p in L, but it would require extensive changes to the base code. To integrate using 
inert primes, one would need to implement the g-power Frobenius for residue 
fields IF, where qg is a power of p. The code directly extending Balakrishnan 
and Tuitman’s code for Coleman integrals and the code for batch computations 
of rational points and testing the algebricity of points and Jacobian relations are 
available on GitHub [HMb, HMa]. 


4 The Data 


In this section we discuss the overall data and give in-depth explanations on 
examples of interest. For the rest of the section, we fix co € X(Q) as our basepoint 
for the Abel-Jacobi embedding. 

We computed the Chabauty—Coleman sets X (Q,)1 of 1403 rank | Picard curves 
of discriminant bounded by 10!? and provably determined the set of rational points 
on these curves. In 859 cases, we found a rational point P € X(Q) whose class 
[P — co] under the Abel—Jacobi map is infinite order. For the other 544 curves, we 
used infinite order divisor classes not given by the image of a rational point under 
the Abel-Jacobi map.° 

On all curves, the naive search for points of height at most 1000 yielded all of 
the rational points. In Fig. 1 we display the number of Picard curves with n rational 
points. Every Picard curve has exactly one point at infinity and it is rational; the 
curves in our database had at most six rational points. In Fig. 2 we show the primes 


>For the remaining 544 curves, it can be quite computationally expensive to compute X(Q) 
depending on the parameters N, e, and p. Individual curves can require up to several hours. This 
computation was run on a single core of a 28-core 2.2 GHz Intel 2 Xeon Gold server with 256 GB 
RAM. 
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Fig. 2. Primes used to determine X(Q,); for deg(g) > 1 


used for the curves with divisors not given by the image of a rational point under the 
Abel—Jacobi map. These are the first split primes in the splitting field of g, where 
the divisor is the rational lift of div(g), and g is chosen from a list of possible g 
returned by RankBounds. For ease of reading, we leave out the primes used once: 
those primes are 43, 61, 83, and 107. 

For all but 14 curves, the only extra points in the Chabauty—Coleman set were 
the ramification points whose image in J is 3-torsion. For six curves, we found 
points Piop € X(K) such that [Piop — 00] € J(K) is a torsion point of higher order 
defined over a number field K. These torsion points were 9-torsion and 4-torsion; 
the corresponding points Pio, are all of the form (a, b'/3) with a,b € Z. For five 
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curves, we found extra points Pji, € X (K) defined over a number field K such that 
there exists n € Z so that n[ Piin — 00] is infinite order in J(Q). For three curves, we 
found an extra point in the Chabauty—Coleman set whose presence is explained by 
an automorphism of the curve. 


4.1 Examples 


For some Picard curves X and choices of prime p, we found interesting global 
points in X(Q,); that were not ramification points or rational points. We discuss 
four notable examples here. 


Example 4.3 Consider the curve X : y> = x4 + 6x? — 48x — 64 with discriminant 
1289945088. Then J has the infinite order point [(—3, — 1) — co] so we compute the 
Chabauty—Coleman set at p = 5 and find an extra point T = (t, 1/2t* — 4) defined 
over Qiry/(t? — 24t — 48) (LMFDB 3.1.108.1). However, if we instead compute the 
Chabauty—Coleman set at the prime p = 17 we find X(Q17); = X(QUWU{T, S} 
where W is the pair of 17-adic ramification points. The point S is a different point 
S = (s, s?+6s+8) defined over a different number field Q[s]/(s? + 9s7-+24s +24) 
(LMFDB 3.1.216.1). 

In the Jacobian of X, we have that 18[T — co] = 9[S — oo]. Furthermore, relating 
this back to the known point of infinite order in J (Q), [(—3, —1)—0o], we can show 
that 18[T — co] = 3[(—3, —1) — 00]. So S and T appear in the Chabauty—Coleman 
set by linearity of the Coleman integral. 


Example 4.4 The curve X : y> = x4 + 25x39 — 78x? + 76x — 24 with discriminant 
411210307584 has Mordell—Weil rank | and possesses the infinite order point D € 
J(Q) that is a lift of div(x? — 6x + 4). 

The first prime that splits completely in Q[x]/(x? — 6x +4) is 11, so we compute 
X (Qi1)1. We obtain the rational points, one ramification point, as well as the point 
(2, 32!/3), whose image in J (Q) under the Abel—Jacobi map is 9-torsion. 


Example 4.5 In their Chabauty-Coleman experiments on genus 3 hyperelliptic 
curves with Mordell—Weil rank 1, Balakrishnan—Bianchi—Cantoral-Farfan— 
Ciperiani-Etropolski [BBCF*] find a novel reason for extra points in the 
Chabauty—Coleman set. They exhibit a curve with two extra points coming from 
the presence of extra endomorphisms of the Jacobian, which are not explained by 
linearity or torsion. Motivated by their example, we searched Sutherland’s Picard 
curve database for similar examples and found several. This first example of a curve 
with an extra point in the Chabauty—Coleman set explained by the automorphism 
group of the curve comes from searching outside of our rank | database. 

Let X : y> = x*—2 be the Picard curve with discriminant 10319560704000000. 
The Jacobian of this curve has rank 2 and has linearly independent non-torsion 
classes [(2, 3) — co] and [(—2, 3) — oo] in J(Q). 
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Table 1 Points in X(Q)3)1 


Set Points 

XQ) (2, 3), (—2, 3), 00 

W Four Galois conjugates of ((5/ 2)'/4 0) 

T Twelve Galois conjugates of ((45/2)!/4, 40!/3), Three Galois conjugates of 
(0, (—5)'/9) 

A Two Galois conjugates of ((—4)!/2, 3) 


The Q-automorphism group of X is a group of order 48 defined over the 
degree 12 number field L := Q[x]/(x!? — 6x!! + 21x!9 — 50x° + 90x8 — 
126x7? + 191x® — 276x° — 285x+ + 950x? — 354x? — 156x + 676) (LMFDB 
12.0.7652750400000000.2). The group is isomorphic to SL2(IF3) x, C2 where the 
image of p : Cp + Aut(SL2(F3)) is given by conjugation by an element of order 2. 
Let p = 13 be the first prime that splits completely in L. Computing the Chabauty— 
Coleman set at p, we find X(Q\3); = X(Q) UW UT UA where W is the set of 
ramification points, T is the set of points whose image in J (Q) is torsion, and A is 
a set of points that will be explained only by automorphisms of the curve, as shown 
in Table 1. 

To verify that the points in T give rise to torsion classes in the Jacobian, while 
those in A do not, we compute the Coleman integrals f i aj,i = 1,2,3 on regular 
differentials for each P in T and A using the new functionality for integrating to 
points defined over number fields. These integrals are 0 on points in T to our p- 
adic precision, but not zero on points in A. We then algebraically check that for 
P €T, the class n[P — co] = 0 € J(Q) for some n € Z using Jacobian arithmetic 
in Magma. The image of each point in T of the form ((45/2)!/4, 40!/3) under the 
Abel—Jacobi map is a 12-torsion point on the Jacobian, while those of the form 
(0, (—5)!/3) are 4-torsion points in J. 

On the other hand, we show that the classes Ay := [((—4)!/*,3) — co] and 
Az <= [(—(—4)!/2, 3) — 00] do not have Z-linear relations with P; := [(2, 3) — oo] 
and P2 := [(—2, 3) — oo] in J(Q). Suppose nA; = a P| + bP2 for somen,a,b € Z 
with n 4 0. Then nA} = aP; +b P2 where c is complex conjugation. But A = Az, 
and subtracting we would have n(A; — Az) = 0 in J(Q). However, computing the 
Coleman integrals to A; minus the Coleman integrals to Az we get a nonzero value, 
so A; — Az cannot be torsion. 

Therefore we need a new reason to explain the presence of the points in 
A in the Chabauty—Coleman set. This reason will come from examining the 
automorphisms of X. When the vanishing differential is an eigenvector for a 
nontrivial automorphism of the curve, there is potential for extra points to appear 
in the Chabauty—Coleman set. Recall that @) = ydx/f,@2 = xydx/f, and 
3 = y’dx/f. Note X has an order 4 automorphism given by y : x +> ix that 
acts by pullback on the differentials: g*w, = iw,, g*w2 = —w2, y*w3 = iw3. The 
set of vanishing differentials Van(X (Q)) has dimension 1; let v € Van(X (Q)) be a 
nonzero element. We will show v is an eigenvector for g* with eigenvalue i. 
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First we will explain this phenomenon by showing why the points of A are in 
the Chabauty—Coleman set, assuming v is an eigenvector for g* with eigenvalue 
i. It is enough to show that integrating A; against the vanishing differential v is 
zero. Note that choosing square roots, g((2, 3)) = ((—4)!/2, 3) and g((—2, 3)) = 
(—(—4)!/?, 3). Using change of variables and applying the fact that our vanishing 
differential is an eigenvector, we have 


g((2,3)) 
[=f v= f otv=if v=0 
A} y(oo) Pi P| 


and similarly for Az. Thus the points of A are in the Chabauty—Coleman set because 
of 9. 

Now we prove that v is an eigenvector for g*. Write v = A@, + Bw + Ca3, 
with A, B,C € Q,. The vanishing differential is defined (up to a scalar multiple) 
by the equation Sp, v = 0 for j = 1, 2. Acting by y? we have 


g° ((2,3)) 
o= | v= f v= | @rro= | —Aa, + Bw? — Ca3. 
Po 9? (co) Py Py 


Adding this to the fact that Sp, v= tp, Aq + Bar + Cw3 = 0 we see that 


2 | Bao = 0. 
P| 


We compute that the integral over P, of w2 is not zero, so B must be zero, and 
v = Aa + Ca; is indeed an eigenvector for g* with eigenvalue i. 


Example 4.6 In addition to the previous example, we found three examples of extra 
points coming from automorphisms of the curve that arose during the computation 
of the Chabauty—Coleman sets in our rank 1 database. We describe one here. Let X : 
y? = x442x7 + 6x7 +5x 42 be the Picard curve with discriminant 277826509467 
and Mordell—Weil rank 1. There are no small affine rational points on this curve, 
so We use an infinite order point D € J(Q) that is a lift of div(x? + x — 1), and 
we compute the Chabauty—Coleman set for X at p = 11, the first split prime in 
Q[x]/(x? + x — 1). We obtain: 


X(Qu)1 = {o0, (1/2, ¥13/16)}. 


Let R := (—1/2, ./13/16). To check if R is torsion, we compute the Coleman 
integrals on the regular basis differentials - . @;,i = 1,2,3, which are not all 
zero. The presence of R in X (Q),); also cannot be explained by relations in J(Q): 
suppose n[R — co] = mD for some n,m € Z \ {0}. Let f3 be a primitive third 
root of unity. Acting pointwise by o : (x, y) Fb (x, oy) on each divisor, we have 
nR° —noo® = mD°. Subtracting from the original relation, we would have n(R — 
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R°) = 0in J(Q), but a computation with Coleman integrals confirms that R — R® 
is not torsion. 

By computing the automorphism group of X over Q[x]/(16x* — 13), we find X 
has an order 2 automorphism gy sending x +> —x — 1 and fixing y. The pullback g* 
acts on Tuitman’s basis of regular differentials by g*w, = —a@, y*w2 = w2, and 
gy*w3 = —w3. By a similar method used in Example 4.5, comparing the Coleman 
integral over D against v, and the Coleman integral over D against g*v, we can 
show that the vanishing differentials are of the form v = Aw, + Cw3, and are 
eigenvectors for y* with eigenvalue —1. Furthermore, y(R) = R and g(co) = co 


and therefore 
R g(R) R R 
/ v= f v= f ov=-f v. 
i) g(co) 0° oo) 


So a v = 0, showing R € X(Q)4)1. 
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Appendix 1: Precision Heuristics 


Recall that, in the process of computing (non-tiny) Coleman integrals - @ where 


one endpoint Q is in a bad disk, we must break the integral up into : ot f bi o 
such that S is near the boundary of the disk. In other words, we find a point S defined 
over a totally ramified extension Q,,( eo °). Balakrishnan and Tuitman’s code takes 
as an input this e, but it can be a balancing act to decide what value to enter: too 
small and the integral will not converge, but the larger e is, the longer the runtime 
(Tables B.1 and B.2). 

This section gives tables of the smallest e needed for effective chabauty 
rounded up to the nearest multiple of five, fixing p and N, for 10 arbitrarily chosen 
curves of rank | with a rational point whose class in the Jacobian is of infinite 
order taken from Sutherland’s database. Entries “bad” denote that a curve has bad 
reduction at that prime. For the tables, after small primes p < 13 we chose a sample 
of primes less than 100 spaced roughly by 10. 

For every curve X in the full database of 1403 Picard curves, we also recorded the 
e value used to compute X (Q,); and to select a non-torsion divisor when N = 15. 
We initialized e at 40 and then incremented by 20 until the integrals to compute 
X (Q,)1 converged. These values can be found in the master list of Chabauty data on 
the 1403 Picard curves masteralldata.txt available in the repository [HMa]. 
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Table B.1 Table for p = 5,7, 11, 13, N = 10 


Discriminant : Curve 5 7 11 13 

31492800 : y> = x4 4+3x3 —3x41 bad (50 |85 | 100 
47258883 : ys = x4 42x93 —x? — x 40 | bad |85 | 100 
70858800 : y> = x44+3x3 —x?-—4x 42 bad |50 |85 | 100 


106288200 : y? = x4 + x3 — 66x? — 324x — 432 | bad |30 |50 | 55 
151322904: y3 = x4 + 11x — 32x74+28x—8 (30 |35 |50 | 100 


212891328 : y> = x44 5x3 4+ 4x? —S5x 41 10 |50 |85 | bad 
105303439827 : y3 = x44 4x3 4+ x7 -—3x-1 30 |50 |85 | 95 
108095630841 : y? = x4 + 4x3 + 6x? — 9x 35 |40 | bad | 55 
113232992256: y3 = x44 x3 — 4x? — 8x 30 |50 |80 | 100 
988868881152 : = 4452 9x43 10 |55 |85 | 100 


Table B.2 Table for select p > 23, N = 10 


Curve 23 31 53 61 71 83 97 
yp =xt43x3- 3x41 180 | 245 420 |485 |565 | 660 | 770 
ypaxt+2x7-3x?-x 180 | 245 420 |485 |565 | 660 | 770 
yi = x44 3x3 — x? —4e 42 180 | 245 420 | 485 |565 | 660 | 770 
yi = x44 x3 — 66x? — 324x — 432 195 | 105 200 | 205 | 240 | 280 | 325 
yi = x44 11x3 — 32x74 28x —8 | 180 | bad 420 |485 |565 | 660 | 770 
ae 5x3 4+ 4x? — 5x41 180 | 240 420 | 485 |565 | 660 | 770 
y= 4x3 +x? —3x-1 180 | 245 420 | 485 |565 | 660 | 770 
yes 4x3 + 6x? — 9x 95 | 120 200 | 230 |270 |315 | 365 
yi =xt4 x3 — 4x? — 8x 180 | 245 bad | 485 |565 | 660 | 770 
yi axt4x2—2x43 180 | 245 420 | 485 |565 | 660 | 770 


Appendix 2: Iterative Hensel Lifting 


In this section we give an algorithm for computing approximate roots in Z, of a 
polynomial with coefficients in Z,,. Concretely, the elements of Z, are represented 
by integers in [0, p'’ — 1] for some positive integer N, the coefficient precision (here, 
[a, b] is the set of all integers k such thata < k < b). Let f € Z[x] be a polynomial 
whose coefficients are in [0, pe — 1]. We compute elements r € [0, p™ — 1] such 
that 


(1) f(r) =0 (mod p™); 
(2) there exists an integer 0 < k, < N such that r € [0, p* — 1] and f(r + p*’s) 
is ideaucally. zero in Z/p™ [s]; 
(3) ifr’ € [0, p&’-! — 1] an r’ =r (mod p*—!) then f(r’ + p*—!s) is not 
identically zero in Z/p™ [s]; and 
(4) if eee (r)) < N then a exists F € Zp, such that f(7) = 0 and |r —r| < 
‘- (in other words, the approximation of 7 by r is correct to precision k,). 
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These properties follow in directly from Algorithm B.0.1, except (4) which is an 
application of Hensel’s lemma (see Definition 3.5). 

From these properties, we see the algorithm computes a system of roots as 
described in Sect.3.2, with the additional property (2) which keeps track of the 
precision of the roots. To compute a system of roots, we inductively lift roots of 
df mod p' to roots mod p‘t! in what is essentially a depth-first-search through the 
elements of Z/p’. 


Algorithm B.0.1 Solve for a system of roots R of f € Z/pNZ{x]. 
Input: Prime p, precision N, and f € Z/pNZ{[x] such that f mod p is not 
identically zero. 


Output: A set R of roots in Z/pNZ such that if r,r’ € R, then r # Yr’ 
mod p¥—»S"), 


(1) Let A = []. 

(2) LetB={aeF,: f(a)=0 mod p} be the F p-roots of f. 
(3) Update A = [(b,1):b € B]. 

(4) Let Z = []. 

(5) While A is nonempty: 


(a) Let (a, 1) be the first element of A. Remove (a, i) from A. 
(b) Let go(s) = f(a + p's). 
(c) If go(s) #0 (mod p™): 


(i) Let v be the minimum valuation of the coefficients of go and set g := 


80/P”. 
(ii) Let B be the roots of g in F y and prepend [(b,i+1):b€ B]to A. 


(d) Else: 
(i) Append (a, i) to Z. 
(6) Return Z. 
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Linear Dependence Among Hecke M®) 
Eigenvalues spooks 


Dohyeong Kim 


Abstract We prove an explicit upper bound on the absolute value of the coefficients 
of a non-trivial integral linear relation among Hecke eigenvalues of a given cuspidal 
eigenform. Our motivation lies in its algorithmic application. For any fixed positive 
integer n, the bound established here yields an algorithm that computes cuspidal 
Hecke eigenforms with a given weight k whose Hecke eigenvalues generate a 
number field of degree n. The resulting algorithm reduces to Cremona’s when n = 1 
and k = 2. 


1 Introduction 


Hecke showed that, if 


CO 


f(M= amq’", am € C 


m=1 


is the Fourier expansion of a primitive normalized cuspidal modular form of weight 
k > 2, then its coefficients satisfy 


dm = O(m?) 


where the implied constant only depends on the form. Deligne, using his deep work 
confirming the Weil conjecture, lowered the exponent by one half. Furthermore, the 
implied constant was made absolute. More precisely, he showed that the inequality 


k-1 


lam| < d(m)m 2 
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holds true for any m > 1, where d(m) denotes the number of positive divisors of m. 
See Ch. 9 of [2] for a treatment of the topic. 

On the other hand, the Hecke field of f(q), the subfield of C generated by all a,,’s 
is of finite degree, say n, over Q. Then, Deligne’s bound allows one to bound the 
coefficients of the minimal polynomial of a, in terms of k, m, and n. This cuts 
out a finite collection of polynomials that can be potentially satisfied by the m-th 
eigenvalue a, of some f(q). 

Our aim in this article is to construct a similar collection of linear relations among 
multiple a,,’s. Concretely, for a given choice (m1, m2,...,™n+1) of n + | distinct 
indices, we want to produce positive integers b;, b2, ..., by +1 which shall guarantee 
a non-trivial relation 


n+l 
\Aiam; =0, A; €Z (1.1) 
i=l 


such that |A;| < b; for all i. More generally, we will consider the possibility of a 
subcollection dm,,...,4m,,, for some r < n admitting a non-trivial relation among 
themselves. This motivates us to consider the relations of the form 


r+l1 
Yo Aiam; =0, a; €Z (1.2) 
i=1 

where Gm), -.-,m,,, generate a Z-module of rank exactly r. Of course, if there is 


a non-trivial relation of the form (1.1), then there exists some r < n for which (1.2) 
holds true after reordering indices. Therefore, it suffices to consider relations of the 
form (1.2) for each r = 0,1, 2,...,” instead of (1.1), maintaining throughout the 
assumption that dm,,...,4m,,, generate a Z-module of rank r. This assumption 
will be exploited repeatedly in subsequent arguments. 

We first introduce a result which applies to all r. For integers k > 1 andm > 1, 
let 


bm.k = d(m)ym > (1.3) 


be the Deligne bound. The inequality (1.5) below is what we call the trivial bound. 


Theorem 1.4 Let f(g) = dons 4nq™ be a normalized cuspidal Hecke eigenform 
of weightk > 1. Ifm,..., mp4 are chosen so that Gm,,...,4m,4, Span a Z- 
module of rank r, then there is a non-trivial relation wae, Aidm, = 0 with dr; € Z 
such that 


Nie 


els” | [| Pmt | | >, Pony tk (1.5) 
Axi LAI : 
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where the product and the sum are taken over | < ji, j2 <r+1and ji, jo Zi. 


Note that the trivial bound does not involve the degree of the Hecke field. 

We move on to our main result whose aim is to refine the trivial bound. Note that 
if x (m) = 1, where x is the character of f(q), then the corresponding eigenvalue a), 
is totally real. Therefore, such eigenvalues will belong to the maximal totally real 
subfield F of the Hecke field. From now on, we will denote the degree of F by n, 
and the main result applies to the case r = n. To formulate it, we will need two 
positive quantities n(n) and 0(n). We require that 7() is a lower bound for the root 
discriminant of a totally real number field of degree n. Explicit values of n(n) have 
been obtained by Odlyzko [9]. For 6(n), we require that any lattice of dimension n 
inside R” admits a basis whose orthogonality defect is at most @(n). Various forms 
of @(n) are available [7, 10]. Asymptotically, n(n) = M — o(n~7/3) with M = 
60.8395... is due to [9], and log(@(n)) = O(nlogn) due to [7]. In any case, we 
define 


O(n) 


p(ay= nny 


to be the ratio. Note that p(n) is independent of k. We list some of its values in 
Table 1. 
Here is our main result. 


Theorem 1.6 Let f(qg) = >) ,51; 4nq™ be a normalized cuspidal Hecke eigenform 
with character x and weight k > 1. Let n be the degree of the maximal totally real 
subfield of the Hecke field of f (q). If m1, ...,mn+1 are chosen so that x(m;) = 1 
for all i and that am,,...,Qny,,, generate a Z-module of rank n, then there is a 


non-trivial relation eae idm, = 0 with dj € Z such that 


1 


2 


il <n"? p(n) | do Ont (1.7) 
J#i 


where the sum is taken over 1 < j <n+1andj <i. 


Since the right hand side of (1.7) depends on the choice of m1, m2,...,™n+1, 
one is naturally interested in making a good choice. We largely ignore the problem 
to make the best possible choice, and contend ourselves with the fact that some 
choice can be made without recourse to the form f(q). For example, given k and x, 
one may choose first n values of m;’s with x (m;) = 1. 

We compare the trivial bound (1.5) and the main theorem (1.7). Since the last 
factor is common to both, we may compare r’ |] j, Pm jy .k and n"!? p (n) whenr =n. 
The latter is, first of all, independent of the weight k of the form in consideration. 
Therefore, (1.7) is an improvement of (1.5) in the weight aspect. On the other hand, 
we observe that p(n) decreases at least up ton = 8. We give some approximate 
values of p(n) in Table 1. Asymptotically, as we noted earlier, we know from [7] 
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Table 1 Some approximate values of p(n) 


n 2 3 4 5 6 7 8 
p(n) 0.52 0.21 0.078 0.030 0.013 0.0064 0.0039 


that p(n), in logarithm, belongs to O(n logn). Therefore, (1.7) is not worse than 
(1.5) as n grows to infinity. 

Refinements of the trivial bound are relevant in the following computational 
problem. Let n and k be fixed positive integers. If V is a vector space of modular 
forms of weight k on which a set of Hecke operators act, then the problem asks 
one to determine whether V has an irreducible subquotient of dimension n. It was 
studied by Cremona [5, 6] in the case n = | and k = 2, giving rise to an influential 
database of elliptic curves. We will explain in Sect. 7 how our work generalizes a 
part of it. 

The organization of the paper is as follows. In Sect. 2, we will prove Lemma 2.2, 
a key technical tool, about linear relation among algebraic integers. Then, we will 
continue to review bounds for discriminant in Sect. 3 and orthogonality defects in 
Sect.4, both of which we employ to bound p(7). In Sect.5, we apply Deligne’s 
bound, and note a connection to Kronecker’s theorem. After reviewing these results, 
the trivial bound and the main theorem will follow by straightforward bootstrapping 
in Sect.6. In Sect.7, we conclude with a description of our algorithm and a 
comparison to Cremona’s work. 
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2 Linear Relations Among Algebraic Integers in a Given 
Number Field 


Suppose that F is a number field of degree n with its ring of integers Or. 


Consider n + 1 elements a1,...,@n+1 € Of which generate a Z-submodule of 
rank n. 
Using a;’s, form an additive map 
A: Z*! _, Op 
which sends the i-th standard basis to a;, fori = 1,2,...,n+ 1. 


Choose an identification Or = Z” using a Z-basis of Of. Then, A is represented 
by ann x (n+ 1) matrix A. Denote by A; the submatrix of A obtained by removing 
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the i-th column fori = 1,2,...,2+ 1. It is a square matrix whose determinant 1S 
independent of the basis we have chosen for Of. Put Aj = (—1)'det(A;) and A = 
(A1, 8 Lg) An+1)- 


Lemma 2.1 We have 4.4 0 and A(A) = 0. 


Proof The first assertion follows from the assumption that a1, ..., @n41 span a Z- 
module of rank n. The second assertion follows from Cramer’s rule. oO 


Our aim in this section is to bound |A;|’s. Foreachi = 1,...,-+ 1, let L; be the 
submodule of Of generated by the columns of A;. A basic property of determinants 
tells us that |A;| = [Or : L;], where we use the convention that [Or : L;] = 0 
when L; is of infinite index in Or. We conclude that the problem to bound A;’s 
reduce to bounding [Of : L;]. We describe below an elementary method for it. 

Without loss of generality, we may assume i = n+ 1 and Ly4; has rank n. 
Put L = L,+1 for simplicity of notation. We would like to bound [Or : L] in terms 
of o(a;) foro ¢ Ip, the set of archimedean places of F. Let 


M:F>R 


be the function which assigns to each a € F the average of |o(a)|? taken over o € 
Ir. In fact, M(q) is absolute in that it does not depend on the choice of a number 
field that contains w. For example, M(1 + V2) = AG + 2/2) + 53 — 2/2) =3 
and M(1 + ./—3) = 4. We also define 


ua) = (M(a))'/. 


Lemma 2.2 We have 


1 


2 


[Or : L]< oa (I Ha) {0 n@j) 
i=} j=! 


for any Z-basis B,,..., Bn of OF. 


Proof Lett: F — Qbe the trace map. We rewrite 


__ (det (c(Bjar))) | 


Or: L|= 
WOE EA eee (B NI 


whose denominator is |D | by definition. Thus, it suffices to prove the inequality 


1 
2 


|det (t (Braj) | <n” (Fe). Yue] . (2.3) 
i=l j=l 
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By the Hadamard inequality, the magnitude of the determinant of a matrix is at most 
the product of the Euclidean lengths of its columns. On the other hand, |t(@B)| < 
nj(a)u(B) for any a, B € F by the Cauchy-Schwartz inequality. Applying these 
two, one obtains 


1/2 
n n / 


Idet (r(Biaj)) |<] [| > ties)? 


i=1 \ sel 


= 
= 


<T] [2 @e@u@))? 


i=1. \ysl 


n n 2 
<m"( uaa), Yo w@jy 


as desired. Oo 
We rewrite the inequality in Lemma 2.2 in the form 


1 


2 


nt/2 nr/2 n n . 
(Or: L1< ap peel lee: Yona]. (2.4) 
i=1 j=1 
Since this holds for any choice of (1, ..., By), it becomes 

1 

nn? nz 2 n ; : 
[Or :L1< pe | int, oe Teo) | m@y?} . 25) 

a " i=] j=l 


where the infimum is taken over the set of Z-bases of Or. The problem to 
bound [Of : L] is reduced to three subproblems corresponding to each factor. 
First, we need a lower bound for |Dr|. Second, we need to find an upper bound 
for the factor in the middle. Lastly, we need an upper bound for each w(a;). In 
subsequent sections, we will discuss bounds for these factors. Note that only the last 
part depends on the lattice L, while the first two are determined by Or. After that, 
in Sect. 6, we summarize the results and prove Theorems 1.4 and 1.6. 


3 Discriminants and Odlyzko Bound 


We review the work of Odlyzko on lower bounds for discriminants of number fields. 
Let F be a number field of degree n. Let Dr be the discriminant of F, and dp := 
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Table 2 Root discriminant bounds, totally real fields 
n 2 3 4 5 6 7 8 9 


n(n) 2.223 | 3.610 | 5.067 | 6523 | 7.941 | 9.301 | 10.596 | 11.823 
n*(n) | 2.225 | 3.630 | 5.124 | 6640 | 8.143 | 9.611 | 11.036 | 12.410 


|Dr|!/” be its root discriminant. Our main concern is to bound 6 when F is totally 
real. 
By a root discriminant bound, we shall mean an inequality of the form 


dp = n(n) 
where (7) is a function of n. Table 2 contains some of the root discriminant 
bounds, extracted from Table 4 of [9], where (7) denotes an unconditional bound 


while 7*(n) is conditional on a generalized Riemann hypothesis. 
This allows us to bound the first factor in (2.5). 


4 Lattice Reduction and Orthogonality Defect 


Let A Cc R” be a lattice of rank n. Let v(A) be the covolume of A. For a basis x = 


(x1,...,X,) of A, define the orthogonality defect of it to be 
n . 
od(x) = ies lleill, 
v(A) 
where || - || denotes the Euclidean norm. The Hadamard inequality tells us that the 


orthogonality defect is at least one, and is equal to one if and only if the basis is 
orthogonal. Thus, it provides a way to measure the extent to which a basis fails to 
be orthogonal. It is natural to consider 


od(A) := infod(x), 


where the infimum is taken over all bases of A. We will call it the orthogonality 
defect of the lattice A. 

We explain the relevance of the orthogonality defect in (2.5). We view OF as 
a lattice in F @g R, which we identify with the Euclidean space R” via field 
embeddings F > R. 


Proposition 4.1 The infimum 
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taken over all Z-bases (B, ..., Bn) is equal to od( OF). 


Proof It suffices to note that v(Or)* = |Dr| and that ./n - (B) is the Euclidean 
length of 6 for B € Of. oO 


Now we explain how to bound orthogonality defect of a lattice A C R”. An 
important quantity associated to A is the length of shortest non-zero vector in it, 
often denoted by 4; (A). The Hermite constant y, is defined to be 


Yn = sup Aj(A), 
A 


where the supremum is taken over the set of all lattices with unit covolume. 
Clearly, y; = 1, and other known values of y, range over 2 <n < 8 andn = 24. 
On the other hand, we have an upper bound 


2 n\2/n 
Yn < “7 (24+ =) 
14 2 


due to Blichfeldt [1]. Better upper bounds for y, were established in [3], and 
computed for n < 36 using the relation between lattice packing density and Hermite 
constants [4, p. 20] (Table 3). 

Various reduction algorithms provide upper bounds for od(A) in the form 


od(A) < A(n) 


where @(n) only depends on the lattice dimension n. Whenever such O(n) is 
established, we can apply it to the particular lattice A = Of associated to the 
number field and deduce 


n 
n" /2 


i. 8 I] L(Bj) = od(Or) < O(n) (4.2) 
" i=1 


We will recall those arising from the Minkowski reduction and the Korkin-Zolotarev 
reduction. The Minkowski reduction implies [10] 
a ifn <4, 


n 
od(A) < (n—3)(n—4)/4 (4.3) 
yn”? (3) ifn > 4, 


The Lagarias-Lenstra-Schnorr bound established in [7], which is based on the 
Korkin-Zolotarev reduction, is 


Table 3. Some known values of powers of Hermite constants 


cn 2 3 4 5 6 7 8 | 24 
yn : 2 4 8 et 64 as 44 
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Table 4 Orthogonality defect bounds 
n 2 3 4 5- 6 7 8 12 16 24 


a(n) 11.155 11.415 |2 | 3.163 | 6.455 | 15.63 | 48.83 |1.4x 104 |1.6x 10’ |4.4x 103 


n 


F 1/2 
oa(a) = x6 T] (=) (4.4) 
i=1 


Lastly, Nguyen and Vallée [8, p. 47] obtained 


4 n(n—1)/4 
od(A) < (5) (4.5) 


based on the Hermite reduction. 

For n = 1, 2, (4.3) and (4.5) agree and they are better than (4.4). For 3 < n < 10, 
(4.3) is the best. For 11 <n < 14, (4.5) is the best. For n > 15, (4.4) is the best. 
Table 4 summarizes in numbers the best among these three methods for 2 < n < 8, 
and n = 12, 16, 24. 


5 Kronecker’s Theorem 


We come to the last factor in (2.5). Let a be an algebraic integer in a number field F, 
and M a positive real number such that |o(@)| < M for any embedding 0: F > C. 
For example, when a = a» for some cuspidal eigenform f(q) = )-,,+; 4mq", M 
may be taken to be the corresponding Deligne bound. The trivial bound for (a) is 


(a) < M. (5.1) 


For the proof of Theorem 1.6, all we will need is (5.1), but we would like to note in 
this section that we are often able to refine (5.1). 

Let @ be an irrational totally real algebraic integer. Kronecker proved that the 
conjugates of a are contained in the interval [—2, 2] if and only if @ can be written 
asa = ¢ + ¢~! where ¢ is a root of unity. 


Proposition 5.2 Let a be a totally real algebraic integer of degree n such that a is 
irrational. Choose a positive real number M with |o(a)| < M for any embedding 
of F into R. Also assume that M* — t > 4. Then 


1/2 
wa) < (2 -=) : (5.3) 


n 
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Proof If a is of the form a = ¢ + ¢7! for a root of unity ¢, then w(a) < 2 
holds trivially and the claimed inequality follows from the assumption M* — 4 

4, Otherwise, Kronecker’s theorem forces a conjugate of a” to be strictly smaller 
than M? — 4. This yields the claimed inequality. oO 


Returning to (2.5), let us temporarily assume that the field F has an odd degree. 
Since F cannot have any quadratic subfield, it follows that an irrational element a € 
F never satisfies a? € Q. This allows us to use (5.3) instead of (5.1). 


6 The Proof of the Main Theorem and the Trivial Bound 


The proof of the main theorem requires little more than adding up results 


from previous sections. Let dm,,...,4m,,,; be Hecke eigenvalues of f(q) 
where m,...,™n+41 are appropriately chosen so that each a, is totally real. 
Apply Lemma 2.1 to aj = dm,,i = 1,2,...,n +1, and obtain a non-trivial 
relation > Aja; = O. Each |A;| is the index [Or : L;], where F is the 
totally real subfield of the Hecke field of f(g) and L; is the lattice generated 
by @1,...,Qj—1, Mi41,-.-,@n+41. AS explained below Lemma 2.1, we have the 


equality |A;| = [Or : L;]. Note that when L; has rank less than that of Or both 
sides of the equality are regarded as zero. Then, we apply the inequality (2.5) and 
apply the bounds from Sects. 3 and 4 in order. For the sum ik [L(a;), we apply 
trivial bound (5.1) together with the Deligne bound. This completes the proof of 
Theorem 1.6. 

We continue to prove Theorem 1.4, the trivial bound. Suppose that the Hecke 
eigenvalues dy,,..-,4m,,, generate a lattice L of rank r > 0. For simplicity of 
notation, put a; = dj,,. We will apply a method similar to what we used in Sect. 2. 
The argument for Lemma 2.1 and the following explanation implies that there is a 
non-trivial relation yas Aja; = 0 where A; is, up to sign, the index in L of the Z- 
submodule generated by a1, ..., @j—1, @j41,-.-,@-+1. Without loss of generality, 
we assume i = r + 1. For a given basis 61, ..., 6 for L, we have 


det(t (a; B;)) 


ci Sah La i,j =1,2,...,r. 
det(t (A; Bj) 


lAr+1| — 


Because both t (a;q;) and t (a; 6;) are integral and |det(t (@; B;))| < |det(t (aja; ))|, 
the above has a rather trivial upper bound |det(t(a@j;a;)|. Then, the Hadamard 
inequality together with Deligne’s bound implies 


|det(t (ajar; ))| < |det(rpm;,k Pm;,k)| 
1 


a r 2 
r 2 
<r I] Pm;,k Ping k | + 
j=l 


i=1 
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This completes the proof of the trivial bound. 


7 Application: An Algorithm for Modular Forms 


A typical computational problem in the theory of modular forms is to decompose a 
module V over a Hecke algebra into irreducible subquotients. For example, V can 
be a space of modular forms stable under a set of Hecke operators. We will focus 
on the case when V is cuspidal, defined over Q, and the Hecke algebra is generated 
over Q by a suitable set of commuting Hecke operators. 

We describe a general and elementary method for attacking the problem. If V and 
some Hecke operators on it are given, one may start by choosing a Hecke operator T 
and computing its characteristic polynomial, say Rr(X). If Rr(X) is irreducible, 
then V must be irreducible and the decomposition is complete. If not, there must be 
a non-trivial factor of R7(X), say Rr(X ), and the kernel and the image of Rr(T) 
are non-trivial subquotients of V, stable under other Hecke operators as well. One 
repeats this until all Hecke modules are irreducible. 

Often, all one wants to know about V is whether it has a subquotient of particular 
type. For example, one might wonder if V has an eigenform with trivial character, 
weight two, and rational Hecke eigenvalues. In this case, it suffices to compute the 
kernels of JT, — 4 with A = —2, —1,0, 1, 2. An attentive reader may wonder how 
this is possible, given the fact that there do exist eigenforms with irrational Hecke 
fields whose T>-eigenvalues happen to be rational. Let us temporarily call them fake 
rational eigenforms. With the presence of fake rational eigenforms, one would have 
a non-zero kernel of T) — A, say V,, which does not contain any rational eigenform. 
However, V; is almost always much smaller in dimension than V, and it is much 
less demanding to handle V,. This is what Cremona employed in order to generate 
his database [5, 6] without fully decomposing relevant spaces of modular forms. 

Suppose that one is instead interested in the type of irreducible subquotients of V. 
If W is an irreducible subquotient, then the Hecke algebra H(W) acting on W itself 
is a number field! and its degree is equal to the dimension of W. Let n(W) be the 
degree of the maximal totally real subfield in H(W), and we shall call n(W) the 
essential degree of W. We consider the problem to find all irreducible subquotients 
of V with essential degree n. Since n(W) is at most the dimension of H(W), it 
subsumes the problem to find all irreducible subquotients of dimension n. A natural 
question is, especially when V has such a large dimension that the characteristic 
polynomials of Hecke operators are difficult to compute and factor, whether one 
can obtain all the desired subquotients without fully decomposing V. It is indeed 
possible, as we explained in the introduction, that one can use Deligne’s bound 
to produce a finite family of polynomials with integer coefficients in a Hecke 
operator, which plays the role of {7, — A: |A| < 2} in Cremona’s strategy. Our 


‘Indeed, it is isomorphic to the Hecke field of any eigenform in W @g C. 
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results provide an alternative answer to this question. If W is such a subquotient, 
then there must be some non-trivial linear relation among the restrictions to W 
Of Tiny, ++ +s Tiny, Where m1,...,™n+1 are chosen so that each T,,, has totally 
real eigenvalues. Reordering indices, for some 0 < r < n, cee oc Trace | isa 
minimal set admitting a non-trivial linear relation. Then, 7), := Eee AiTm, would 
annihilate? W for some vector A = (A1,..-,Ar41) € O with integer entries. If r = 0, 
then some 7,,, annihilates W. If 0 < r < n, then the trivial bound (1.5) applies. 
Ifr =n, then (1.7) applies. We note that ifn = 1 and the space V consists of forms 
with trivial character, then our method reduces to Cremona’s. 

We illustrate the algorithm by reporting two numerical examples which we ran 
in SageMath. Using Brandt matrices, we computed the Hecke operators T> and 73 
acting on the Brandt module V of level N = 13,499. The module V is defined 
as a vector space over Q and its dimension is 1126. Elements in V correspond to 
modular forms of weight two. The problem is to determine whether V has any two- 
dimensional irreducible subquotients. The finite set of linear combinations of 1, 7>, 
and 73, as constrained by Theorems 1.4 and 1.6, turn out to have trivial kernels. 
This shows that V has no irreducible subquotients of dimension two. For the second 
example, we take N = 12,553, whose associated Brandt module has dimension 
1046. Then, | + 72 — 73 turns out to be the unique linear combination with a non- 
trivial kernel. Its dimension is two, on which the action of 73 is irreducible. We 
conclude that the Brandt module of level 12553 has a unique irreducible subquotient 
of dimension two. 
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Congruent Number Triangles with the M®) 
Same Hypotenuse sei 


David Lowry-Duda, with an appendix by Brendan Hassett 


Abstract In this article, we discuss whether a single congruent number f can 
have two (or more) distinct corresponding triangles with the same hypotenuse. We 
describe and carry out computational experimentation providing evidence that this 
does not occur. 


1 Introduction and Motivation 


A congruent number is a rational number that appears as the area of a rational 
right triangle. The congruent number classification problem is the problem of 
determining whether a given number f is congruent. By scaling the triangle, one 
may consider only the classification of squarefree integers t; we do this throughout 
the article. The study of congruent numbers is ancient and has a long history; see 
the survey article of Conrad [Con08] for more. 

We consider whether a single congruent number ¢ can have two distinct rational 
right triangles with the same hypotenuse. This question was first motivated by an 
observation in [HKLDW 18]. 

In that paper, Hulse, Kuan, Walker, and the author consider a convolution sum 
whose main term exists only when f is congruent. Let t(m) denote the square 
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indicator function, taking 1 if n is a square and 0 otherwise. The primary theorem 
of [HKLDW 18] is that 


SY Se cam +n)t(m)t(m = n)t (nt) = CVX + O,((log XJ"), (1.1) 


n<X m<X 


where r is the rank of the elliptic curve E, : Y? = X* — t?X and 


a=) > (1.2) 


heH(t) 


is the (convergent) sum over the set of hypotenuses H(t) of dissimilar primitive 
right integer triangles with area equal to tu? (i.e. the squarefree part of the area is f). 

Although there is implicit dependence on ¢ in the error term and poorly 
understood dependence on ¢ in the main term, heuristically one should expect that 
for large X, the main term “quickly” dominates. 

In [HKLDW18], it is also shown that the set of hypotenuses H(t) is only 
logarithmically dense in Z;,. Thus heuristically the coefficient C, of the main 
term is well-estimated by its first term m/h, where h is the smallest hypotenuse 
in H(t) and m is the multiplicity of this hypotenuse. Further, the main term should 
heuristically grow larger than the error term quickly once X > h/m. 

The question of the size of the smallest hypotenuse in #/(f) is closely related to 
the least height of a non-torsion point on E; (we make this more precise in Sect. 3), 
but the multiplicity m is more mysterious. 

In this article, we describe numerical experimentation suggesting that the 
multiplicity is always exactly 1. We also propose that this is always the case as a 
conjecture. 


Conjecture There do not exist two dissimilar primitive right triangles with the 
same hypotenuse and whose squarefree parts of the areas are equal to each other. 


In Sect. 2, we describe why this problem might be hard to fully resolve. Hassett 
describes this more completely in the Appendix. 

We then describe and carry out numerical experimentation using the free and 
open source math software SageMath [Sag20]. Our experimentation comes in two 
forms. Firstly, we do a deep investigation for those congruent numbers below 1000. 
For each congruent number, we generate many different triangles by generating 
rational points E,(Q) and the corresponding right triangles. 

Secondly, we do a broad investigation for several congruent numbers correspond- 
ing to curves FE; of high rank. These curves will have more rational points up to 
a bounding height, heuristically leading to more triangles with hypotenuses up to 
some bound. In this investigation, we make heavy use of [WDE* 14], which gives 
several congruent number curves EF; of rank 6 and 7. In total, we investigate 1513 
curves of ranks 6 and 7. 

We describe these approaches and the results in Sects. 3 and 4, respectively. 
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2 Algebraic Formulation 


One can parametrize right triangles with two variables (s, ft) such that the sides 
are given by (s? — t7, 2st, s* +t”). We follow the convention that the hypotenuse 
is written last in any triple (a,b,c) giving a right triangle, and that s > f. In 
this correspondence, primitive integer right triangles correspond to relatively prime 
integers s and ¢. 

Thus finding two right triangles with the same hypotenuse and whose areas are 
the same up to multiplication by squares can be reformulated as finding integers 
(s, t, S, T) such that 


Peres 47? 
and where 


st(s?— 12), 
——— 18 a Square. 
sT(s2—-TH 4 


This last equation can be rewritten as 
Pes =P) Hv sts aT 


for some positive integers u, v. 
These two equations define a surface 


3 1 
XC Pic t.s.71 X Play 


If we could understand all points on this surface, we could almost certainly resolve 
the conjecture. Unfortunately, this understanding appears to be beyond current 
algebraic techniques. This is considered by Hassett in the Appendix, where it is 
shown that the surface X admits a resolution of singularities that is of general type 
and simply connected (see Proposition A). 

Thus we expect that a complete resolution of the conjecture is not within reach. 


3 Description of Methodology 


With a purely theoretical resolution likely out of reach, we turn now to com- 
putational experimentation. There is a well-known correspondence between right 
triangles (a, b, c) with ab/2 = t (which we may assume without loss of generality is 
a squarefree integer) and Q-rational points on the elliptic curve E, : ¥Y* = X?—1?X 
where Y ¥ 0. The inverse maps of this correspondence are given by 
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tb —-2r? 
,——) = (X,Y), 
CcC—-a C—-a 
X27? 2x xX?47 
Y = oY: * Y 


(a,b,c) > ( 


(X,Y) ( ) =@b,0. 
(See [Kob93] for a historical overview and description of the relationship between 
congruent numbers and elliptic curves.) 


3.1 Enumeration Through Elliptic Curves 


This provides an explicit and computable way of enumerating all rational right trian- 
gles with area r: find generators for E;(Q) and enumerate their linear combinations. 
This method of enumeration is of course vastly superior to naive enumeration and 
is at the core of the organization of the numerical experimentation. 

In principle, it is necessary to also include the torsion subgroup of E;(Q). 
However for these curves, the torsion subgroup is completely understood and can 
be ignored. In particular, the torsion subgroup is isomorphic to Z/2Z x Z/2Z, 
generated by two points 7) and 7. If the triangle (a, b, c) corresponds to the point 
P é€ E,(Q,, the eight equivalent triangles (-Ea, +b, +c) correspond to the points 
+P + €,T, + €27>, where e; € {0, 1}. Thus we omit no triangles (up to similarity) 
by omitting consideration of torsion points. 


3.2 Hypotenuses, and Heights, Grow Exponentially 


Given an elliptic curve with known rank and generators, the primary obstruction to 
collecting numerical evidence is the size of the triangles and hypotenuses. There is 
a general theory that the heights of points on elliptic curves grow very rapidly under 
repeated addition. In Corollary | of [HKLDW 18], it is shown that 


I{h € H(t) :h < X/2t}| = O,((log X)"””), 


where r is the rank of the corresponding elliptic curve E;. In particular, letting H(P) 
denote the naive height of the X-coordinate of P, it is shown that the hypotenuse 
h corresponding to a point P on E, satisfies h > H(P)/2t, and thus hypotenuses 
grow at least as quickly as the heights. 


Example 1 The elliptic curve Eg : Y* = X? — 36X has rank | and the free part 
of E¢(Q) is generated by g = (—3, 9). This point corresponds to the primitive right 
triangle (3, 4, 5). The hypotenuse is 3 bits long. The point 2 * g = (25/4, —35/8) 
corresponds to the primitive right triangle (49, 1200, 1201), whose hypotenuse is 
11 bits long. The point 4 « g has hypotenuse 2094350404801, which is 29 bits long. 
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The point 400 * g (the largest multiple we considered on this curve) corresponds to 
a right triangle whose hypotenuse requires 410426 bits to store. 


As heights grow exponentially, we see that hypotenuses grow exponentially. 
If we assume that the hypotenuses behave like exponentially growing random 
variables, then heuristically we would expect that it is very unlikely that two points 
of large height correspond to the same hypotenuse. Informally, we should expect 
that if a pair of points yields two triangles with hypotenuse that are close in size, 
then both points are probably of small height. 

For curves of rank 1, it is impractical to compute more than a few hundred 
triangles—they simply grow too quickly. For curves of higher rank, however, it is 
possible to compute millions of triangles before the individual sides of the triangles 
become prohibitively large. 


3.3. Two Strategies of Experimentation 


We investigate two potential sources for a counterexample to the conjecture: when 
all numbers are small and coincidental collisions are most likely, and when there 
are many triangles up to a certain size due to high rank. Thus we organize our 
experimentation into two strategies: one for elliptic curves E; with small t, and one 
for high rank elliptic curves. 

First, we study the curves EF; for all squarefree t < 1000. Each of these curves 
has rank 0, 1, or 2. In total, this includes 361 curves corresponding to congruent 
numbers. These are the curves with the smallest t and with the simplest triangles. 

As these curves have low rank, we can carry out “deep” investigation on each 
curve. More specifically, if {g;} denote generators for the free part of E;(Q), then 
we can study triangles coming from )° a; g; for all coefficients {a;} in a large box 
|a;| < K for aconstant K = K(f). For rank | curves, we choose K (t) > 300, and 
for rank 2 curves we choose K (t) > 75. As the points P and — P generate the same 
triangle, this translates to considering at least 300 triangles from each rank | curve 
and at least (1517 — 1)/2 = 11,400 triangles from each rank 2 curve. 

Second, we study curves E; of known high rank. To get these curves, we use 
the large-scale project detailed in [WDE™*14], which builds off of earlier work 
of Rogers [Rog00, Rog04]. In this project, Watkins et al. investigated ranks of 
congruent number curves. They identify 1486 curves E; of rank at least 6 and 27 
curves of rank 7, giving 1513 curves in total of high rank. We examine rational 
points on each of these 1513 curves. 

We note that no curves of higher rank are known, and it is conjectured that 
ranks in the family of congruent number curves are bounded [Hon61]. In §11 of 
[WDE* 14], it is further demonstrated that a heuristic of Granville’s suggests that 
the maximum rank of a congruent number elliptic curve is 7. 

As noted in Sect. 3.2, we should heuristically expect that it is more likely for 
points of small height to give two triangles with the same hypotenuse. For this 
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reason, we carry out a “wide but shallow” investigation of these curves. Again letting 
{gi} denote a set of generators for the free part of E;(Q), we study triangles coming 
from points }° a;g;, where all coefficients {a;} are in the small box |a;| < 4. This 
gives 267520 triangles from each rank 6 curve and 2391484 triangles from each 
rank 7 curve. 


Remark 2 It is likely that using the box |a;| < 2 would be sufficient. In 
Sect.4, we observe that |a;| < 2 finds all closest pairs of hypotenuses on the 
curves investigated. We use the larger box to conduct a more robust investigation. 
Computing and manipulating these points is by far the most time intensive portion of 
this experiment. With the larger coefficient boxes |a;| < 4, we spend approximately 
32 min on average for each rank 6 curve, and approximately 5h on average for each 
rank 7 curve. In total, we used approximately 940 CPU hours on this part of the 
experimentation. 


3.4 Computing Rank and Generators 


In order to generate triangles for a congruent number f, we require that we can 
compute generators for (the free part of) E;(Q). We perform computations for two 
different sets of curves: those curves E; with t < 1000 and for several chosen curves 
of higher rank. 

For the 1513 curves of higher rank, we use ranks and generators available as 
an electronic supplement to [WDE*14].! We note that not all of these curves 
have explicitly known rank: all have rank at least 6, and 27 have rank 7, but 
Watkins et al. couldn’t verify the ranks of each curve. We do not try to complete 
this verification here, and instead use the 6 known generators for each curve of 
presumed rank 6. Curves corresponding to t < 1000 can be handled within 
SageMath [Sag20]. The methods EllipticCurve.rank and EllipticCurve.gens from 
SageMath works for all but 84 cases. The smallest problematic example is 113. 
We note that the curve £113 is contained within the L-function and Modular Form 
Database (LMFDB) [LMF19]. However, it is also possible to directly use Tunnell’s 
criterion [Tun83] to quickly confirm that 113 is not congruent. Applying Tunnell’s 
criterion on the remaining 83 cases to remove non-congruent numbers reduces this 
to 54 cases. The smallest remaining example is 157. 

The number 157 is congruent and is somewhat famous for its late classification 
by Zagier [Zag89]; the simplest rational triangle with area 157 is extremely 
complicated. 

The techniques SageMath uses to compute the rank and generators for elliptic 
curves are based on John Cremona’s MWRANK [CMP* 19]. SageMath includes 


'The supplement to their paper made this data available in Magma code. We have converted it to a 
SageMath-friendly format and made it available at https://github.com/davidlowryduda/notebooks/ 
blob/master/Papers/largetdata.sage. 
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a copy of MWRANK and a partial interface to MWRANK’s functionality. Calling 
MWRANK directly and allowing up to 2000 seconds of computation time per curve 
provides generators and ranks for an additional 20 cases (including 157). This leaves 
34 congruent numbers up to 1000, the smallest of which is 277. 

To understand the remaining 34 curves, we use functionality from PARI- 
GP [BBB*00], which is also packaged with SageMath. Although MWRANK 
struggles to exactly compute the rank and generators, it can compute that the 
upper bound on the rank is 1 for each of these curves. For the curve E77 for 
example, calling MWRANK through Sagemath with the command EllipticCurve([- 
277*277,0]).rank_bound() shows an upper bound of | for the rank. 

Thus one can use the Heegner point method [Elk94] to find a generator for 
the rational points on these curves. The existence of such a generator confirms 
that the rank is exactly 1. This is implemented remarkably well in PARI. Calling 
pari(EllipticCurve([-277*277,0])).ellheegner() from SageMath quickly returns a 
generator for E277(Q). 


Remark 3 The CAS Magma [BCP97] is also capable of generating ranks 
and generators for these elliptic curves. An anonymous Referee noted that 
Generators(EllipticCurve([-277*277,0])) returns a set of generators almost 
immediately, and functions almost as quickly for all congruent numbers up to 
1000. 


4 Experimental Results 


We performed the strategies outlined in Sect.3 for each squarefree congruent 
number f with t < 1000 and for the 1513 curves of higher rank from [WDE* 14]. 
(See the author’s demonstration github [LD] for a reference implementation to 
obtain and manipulate generators when t < 1000; for higher t, we do similar 
manipulations with the generators found by Watkins et al.) 


4.1 Results for t < 1000 


Out of the 608 squarefree numbers up to 1000, we verified that 327 of them are 
congruent numbers and produced several triangles for each one. Of these, 274 
corresponding to curves with rank 1 and 53 to curves with rank 2. We note that 
this is consistent with the celebrated conjecture of Goldfeld [Gol79], which implies 
that although there may be infinitely many such curves of rank at least 2, these 
should be sparse and correspond to 0% of all congruent numbers in natural density. 
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Among all these computed triangles, we found no counterexample to the 
conjecture. There were no two dissimilar right triangles with the same hypotenuse 
(after scaling to a primitive right triangle) corresponding to the same congruent 
number. 

Recall that we computed at least 300 triangles from each curve of rank | and 
at least 11,400 from each curve of rank 2. Per the heuristic in Sect. 3.2, we would 
expect that pairs of points corresponding to triangles with the close hypotenuses 
should come from points of small height. 

This is true for all the curves tested for t < 1000. Further, the nearest pairs 
of hypotenuses among triangles associated to the same curve were always among 
the smallest hypotenuses. In terms of our generators {g;}, the smallest pair of 
hypotenuses always came from triangles corresponding to the points }° ajg; with 
la;| < 2. (Although there are not canonical choices of generators, we note that 
MWRANK returns generators with small naive height [Cre97].) 


Example 4 On £¢(Q), whose free part is generated by (—3, 9), the two nearest 
hypotenuses come from the triangles (3, 4,5) and (49, 1200, 1201), which come 
from g and 2g respectively. 

For t = 34, we have that rank(E 34 Q) = 2. The free part of E34(Q) is generated 
by the points gj = (—16, 120) and gz = (—2, 48). The two closest hypotenuses 
come from the triangles (225, 272, 353) and (17, 144, 145), corresponding to gj 
and go respectively. 


This adds support for our choice of box |a;| < 4 for points on higher rank elliptic 
curves. 

Although no pair of hypotenuses exactly matched, we can ask about how close 
two hypotenuses could be. Of those congruent numbers t < 1000, we determined 
that 5 have pairs of dissimilar triangles with hypotenuses that differ by less than rf: 
210, 330, 546, 609, 915. All five of these come from rank 2 curves. 

The curve F210 has a particularly close pair of triangles whose hypotenuses differ 
by 8. This is the pair of triangles (12, 35, 37) and (20, 21, 29), which correspond to 
the two generators for the free part of E219(Q). 


Smallest Hypotenuses Further, we took this opportunity to collect data on the size 
of the smallest hypotenuse for each congruent number < 1000. In Fig. 1, we plot 
squarefree congruent numbers ¢ against the log of the smallest hypotenuse of a 
primitive right triangle with squarefree part of the area equal to f. 

Within the plot, the first congruent number f with smallest hypotenuse greater 
than e!° is Zagier’s number, 157. The largest smallest hypotenuse we computed 
corresponds to the congruent number t = 997, and is approximately e*4?. 

Many smallest-hypotenuses appear to be moderate in size, but no consistent 
pattern emerges. 
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Fig. 1 Smallest hypotenuses for congruent numbers ¢ < 1000 


4.2 Results for Curves of Rank 6 and 7 


We produced hundreds of thousands of triangles for each of the 1513 curves of high 
rank. We found no counterexample to the conjecture. 

As above, we tracked closest pairs of hypotenuses for each curve. Even though 
we computed all rational points )~ a; g; in the coefficient box where each coefficient 
satisfies the bound |a;| < 4, we again observed that for each ¢, the closest pair 
of hypotenuses came from points in the smaller box |a;| < 2. The heuristic from 
Sect. 3.2 appears to continue to hold, indicating that nearest pairs of hypotenuses 
occur among the points of the least height on the curve. 

We consider how close hypotenuses from two different triangles could be. Of all 
the 1513 higher rank curves investigated, 375 has two hypotenuses that differed by 
less than the corresponding congruent number rf. 

In absolute terms, the closest pair came from t = 6611719866 ~ 6.611 x 10°, 
which has triangles with hypotenuses 30544225 and 67119265, differing by about 
3.657 x 10’. This is very large. We also noted a pair for the congruent number t = 
1902736244939034 ~ 1.9027 x 10!5, which has triangles with hypotenuses given 
approximately by 1.061 x 10!° and 1.109 x 10!°, differing by about 4.769 x 10°. 
These are also the two smallest hypotenuses of triangles for t. Although this is also 
very large, this is significantly smaller than both ¢ and the smallest hypotenuse. 

Both of these “near-miss” examples came from rank 6 elliptic curves. 
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Smallest Hypotenuses We again took this opportunity to collect data on the size 
of the smallest hypotenuse coming from each of the 1513 curves of higher rank. As 
the numbers are encompass a vastly larger domain, we present this in two log-log 
plots. 

In Fig. 2, we show smallest hypotenuses for each ¢ from curves of (presumed) 
rank 6. We first explain the lack of points in the upper-right portion of the plot. This 
is caused by the methodology used to generate the 1513 curves in [WDE* 14]—they 
performed an almost-exhaustive search for high rank elliptic curves for all t < 2°°, 
while for higher ¢ they only looked for curves with points of relatively low height. 
As 2°° ~ e4° and as the hypotenuse / corresponding to a point P satisfies the 
bound h > H(p)/2t, this explains the missing region of data. (Although we found 
it surprising that this plot reveals the choices of bounds used to find the elliptic 
curves so clearly.) 

Comparing Fig.2 with Fig. 1, we note the disparity in size. Every hypotenuse 
coming from a higher rank elliptic curve was smaller than the smallest hypotenuse 
on E157. On the other hand, each smallest hypotenuse was larger than el, 

In Fig. 3, we show smallest hypotenuses coming from curves of rank 7. As with 
the previous plot, there are patterns among the curves that result from the way in 
which the curves were initially found. In [WDE* 14], they performed an extensive 
search for curves of rank 7 for t < 2% ~ e4!-6, and for higher t they only looked 
for curves with rational points of low height. Although it is less obvious in this plot, 
we note that we should expect that there is a lot of missing data in the upper right 
portion of this plot. 
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Fig. 2. Smallest hypotenuses from (presumed) rank 6 curves 
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Fig. 3. Smallest hypotenuses from rank 7 curves 


Appendix: Invariants of the Lowry-Duda Moduli Space 
by Brendan Hassett 


We verify that the moduli space X introduced by Lowry-Duda admits a resolution of 
singularities X = X thatis of general type and simply connected. Since XQ) # G, 
existing Diophantine techniques (like Faltings’ Theorem [Fal94]) shed little light on 
the structure of the rational points on X. 


Double Cover Realization 


Let 


3 1 
XC Pee ry X Praag 


be the surface over Q defined by the equations 
s*+r2= S247? 
Css Pa WV ST(S = 7"). 


The first equation defines a surface Y C P? isomorphic to P! x P!. Projection onto 
the first factor induces a morphism wa : X — Y that is generically finite of degree 
two. The branch curve 
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B := {st(s* —t’)ST(S? —T?) =0} CY 


is a union of eight planar sections of Y, each a smooth conic curve defined over Q. 
The morphism @ fails to be flat over the locus 


Z := {st(s* —t*?) = ST(S* —T*) =O} CBCY 
consisting of the following 32 points: 


[0, 1,0, £1] = {s = O}N{S = 0} 
[0, 1, +1, 0] = {s = O0}N{T =0} 


(0, +V2, 1, 1] = {s = O}N{S—T =0} 


(0, +V2, 1, -1] = {s =O} N{S+T =0} 
[1, 0,0, £1] = {tf =O} N{S =0} 
[1, 0, £1, 0] = {¢f =0}N{T =0} 


[+V2, 0,1, 1] ={t =O}N{S—T =0} 


[+V2,0, 1, -l] = {ft =O}N{S+T =0} 


[1, 1,0, +V2] = {s —-t =0}N{S=0} 


(1, 1, +72, 0] = {s —t =O} N{T =O} 
(1, 1,41, +1] = {s —t =0}N{S—T =0} 
[1,1,41,=1)={(s -F=0}n{S4+T =0) 


[1, -1,0, £V2] = {s +4 =0}N{S =0} 


(1, -1, +V2, 0] = {s +r =O}N{T =0} 
[1, -1, +1, £1] ={s+t=0}N{S—T =0} 
(peat Filles penises rol 


Points of Z are nodes of B. The double cover of Y branched over B has Aj 
singularities over these 32 nodes which are automatically resolved in X. 
The remaining points of intersection among components of B are: 
[0,0, 1, +i] = {s =O} N {t =O} N{s —t =O} N{s4+r=0} 
[1,+7,0,0] = {S$ =O} N{T =O} N{S—-—T =O} N{S+T =0} 
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Write 


W := {[0, 0, 1, +], [1, +7, 0, OJ}, 


where B has multiplicity-four singularities. There are (:) = 28 pairs of conics 
altogether so all intersections are accounted for. 


Resolving the Bad Singularities 


We briefly review background on surface singularities: A double cover of a smooth 
surface branched along a curve has ADE singularities if and only if the curve has 
simple (ADE) singularities. An isolated hypersurface singularity s ¢ S is Du Val if 
there exists a resolution 6 : S — S such that K 3 — B* Ks is effective. Complex Du 
Val singularities coincide with the ADE surface singularities [Rei97, §4.20]. Given 
a projective surface S with Du Val singularities, pluricanonical differentials on S 
are the same as pluricanonical differentials on a resolution S.In particular, if Ks is 
ample then S is of general type. 

Returning to X, we seek to resolve the singularities that are not Du Val, which 
are associated with the multiplicity four singularities of B along W. We observe that 


Wc {s? +2? = $747? = 0}, 


i.e., the vertices of a cycle of four rational curves contained in Y, whose complement 
is a torus. The blow-up 


Y :=Blw(Y) > Y 


is therefore toric with boundary consisting of an octagon of P!’s with self- 
intersections alternating between —1 and —2. The induced blow-up 


B:X—7>xX 


resolves the singularities of X. 


Canonical Class of X 


First, we choose a basis for the Picard group of Y: Let fi and f> be the rulings of 
Y~P! x P! and Ej,..., Ea the exceptional divisors over W so 


Pic(Y) = (fi, fo, E1, Eo, E3, Ea). 
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We order so that the boundary octagon has sides with classes 
Ej, fi = Ey = £9, Bay jo = Ex= Ey, Bs, fu £9 = £4, Ea, fo — fa = £1, 
The canonical class 
Ky =—2f, —2fo+ FE, + Ex + £3 + Eq. 


The proper transform B of B—noting the multiplicity four singularities along W— 
has class 


8(fi + fo) — 4(E1 + E2 + E34 E4); 
B is disjoint from the four (—2) curves in the octagon. Blowing down these four 


curves induces a birational morphism 


You 


to a quartic del Pezzo surface with four A; singularities. This morphism is induced 
by the linear series |2 f, +2 fo — E, — Ez — E3 — E4| corresponding to the quadratic 
equations for W modulo the defining equation for Y. 

We refer the reader to [BHPVdV04, IV.22] for a discussion of the invariants of 
double covers of surfaces branched along curves with simple singularities. 

The canonical class of X is obtained by pulling back 


1~ 
Kyte SAU Pe) CE ose Ba) 


through the induced morphism X — Y. The image of Y and X under this linear 
series equals &. We summarize this in the following proposition. 


Proposition A The surface X is of general type with K é = 8, and the linear series 
|K ¥| induces a morphism 


X> Dd 


that is generically finite of degree two. 


In particular, we find 


eo lr 
diml'(Ky) = dimP(Y, Ky + a B) = 5. 
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X Is Simply Connected 


To compute the other invariants of X, we use observations following immediately 
from simultaneous resolution of ADE singularities [Bri68]: X is deformation 
equivalent to a double cover of Y branched along a generic member of 


|B] = |8(fi + fo) — 4(E1 + Ex + E3 + Ea)|, 


disjoint from the (—2)-curves. Smoothing the A, singularities of & and noting that 
the branch divisor lies in |—4K y|, we find that X is also deformation equivalent to a 
double cover of a quartic del Pezzo surface X’ branched over a divisor A € |—4Ky)|. 
Such surfaces are simply connected by the Lefschetz hyperplane theorem, hence 


m(X) = {1}. 


The Remaining Invariants 


The branch curve A satisfies 


deg Ky = 48 


by adjunction on &’. Thus we conclude 


x(X) = 2(x(B') — x(A)) + x(A) = 2(8 + 48) — 48 = 64, 


bo(X) = 62, and hl (Qk) = 52. 
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David Lowry-Duda 


Abstract We examine several currently used techniques for visualizing complex- 
valued functions applied to modular forms. We plot several examples and study the 
benefits and limitations of each technique. We then introduce a method of visual- 
ization that can take advantage of colormaps in Python’s matplotlib library, 
describe an implementation, and give more examples. Much of this discussion 
applies to general visualizations of complex-valued functions in the plane. 


1 Introduction 


1.1) Motivation 


Graphs of real-valued functions are ubiquitous and commonly serve as a source 
of mathematical insight. But graphs of complex functions are simultaneously less 
common and more challenging to make. The reason is that the graph of a complex 
function is naturally a surface in four dimensions, and there are not many intuitive 
embeddings available to capture this surface within a 2d plot. 

In this article, we examine different methods for visualizing plots of modular 
forms on congruent subgroups of SL(2, Z). These forms are highly symmetric 
functions and we should expect their plots to capture many distinctive, highly 
symmetric features. 
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In addition, we wish to take advantage of the broader capabilities that exist 
in the Python/SageMath data visualization ecosystem. There are a vast number 
of color choices and colormaps implemented in terms of Python’s matplotlib 
library [Hun07]. 

In matplotlib, a colormap is a Python data type that associates a color to an 
input real number. While many colormaps are purely aesthetic, others are chosen 
to allow the most accurate perception by as many viewers as possible. By using 
established matplot1lib colormaps, it is possible to choose colors to support 
different types of color perception, including for people who see color differently. 
We describe this further in Sect. 3. 

Although our emphasis is on plots of modular forms, many of the techniques 
described apply more generally to graphing complex-valued functions. We hope 
this paper will be of help to any reader wanting to visualize complex functions. 


1.2. Broad Overview of Complex Function Plotting 


Over the last 20 years, different approaches towards representing graphs of complex 
functions have emerged. The most commonly used approach is one first introduced 
by Frank Farris in a review [Far98] of Needham’s Visual Complex Analysis. 

The idea is to represent the output of a complex function though color. Fre- 
quently, one uses hue to represent argument and lightness to represent magnitude. 
Or more generally, one can associate a color to each point in the complex plane and 
then color the domain of a complex function f by the color of f(z). This technique 
is now called domain coloring. Using hue and lightness to represent argument and 
magnitude is the default complex plotting method in SageMath [Sag20], as well as 
other common complex plotting libraries. 

In principle, such a graph gives perfect information about the function. But in 
practice, it is often difficult to distinguish between hue and lightness — and in 
particular it is challenging to determine if two points with different hues have the 
same lightness. Thus some variations omit a dimension from the graph and plot only 
the magnitude or phase. Even though the resulting graphs do not perfectly capture 
f, the omission of a dimension can be beneficial. Other variations specially color 
the complex plane in other ways. 

Wegert [WS10] describes an approach using domain coloring, but emphasizing 
phase plots as a visual tool. He also produces phase plots with small brightness 
adjustments near certain magnitude thresholds. The effect is similar to looking at 
a contour map of a landscape, except that each pixel’s color carries meaning (the 
phase of f evaluated at that point). 
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In Sect. 2, we give a short survey of common existing complex plotting techniques 
applied to modular forms. We include images coming from the ideas of Farris and 
Wegert, built using SageMath. We also reproduce the plots associated to modular 
forms on the L-Function and Modular Form Database (LMFDB [LMF19]). 

In Sect. 3, we describe how one can incorporate pre-established colormaps into 
modular form visualizations. And in Sect. 3.1, we discuss one such implementation 
in terms of matplot1lib. This is a new implementation. 

Throughout, we revisit the same basic figures with each visualization technique. 
We plot four different modular forms on three different domains in each technique. 
Each plot reveals different behaviors of the underlying forms. 


2 Survey of Visualizations 


2.1 Halfplane and Disk Models 


Plots of modular forms typically represent the form either on the upper half-plane 
or on the Poincaré disk. We first describe the relationship between these two types 
of plots, and then survey different visualizations of forms in each plot type. 

Let H = {x + iy: y > 0} denote the complex upper halfplane. A modular form 
of weight k on a congruence subgroup [ C SL(2, Z) is a holomorphic function 
f:H— C satisfying 


fa =(eztak fi) VWe=(25)eP, (1) 


as well as certain growth conditions at the cusps of '\H. See [DS05] for complete 
definitions. 

To the congruence subgroup I is associated a positive integer N called the level. 
The automorphy condition (1) implies strong symmetry conditions on f in terms of 
N. In particular, ic ) € I implies that f will be periodic with period N. Thus one 
can constrain a plot to a vertical strip of width N without omitting any information. 

In practice, one would cut off the vertical strip at some height H. If f is a 
cuspform, so that limy-,o. f(x + iy) = O, then heuristically most interesting 
behavior of the form will lie within a well-chosen box [0, N] x [0, H]. In practice, 
N is typically much larger than H, and it’s necessary to investigate some smaller 
box. We refer to this type of plot as a plot of f in the upper halfplane. 

Alternately, the upper halfplane is conformally equivalent to the Poincaré disk 
= {x + iy :x*+ y* < 1}. There are infinitely many such maps taking D to H, 
but we choose the mobius transform 
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1—iz 


o(z) = Pi (2) 


Under ¢, the points —i, 0, i in D are mapped to 0, i, oo, respectively, in 7. Thus the 
apparent vertical orientation remains fixed in both models. 

Plotting a modular form f on D is a complete picture; values of f at every point 
of H will be represented through such a plot. We refer this type of plot as a plot of 
f in the disk. 


2.2 Visualizations 


Let A(z) denote the Ramanujan Delta function, the unique holomorphic cuspform 
of weight 12 on SL(2, Z). And let g(z) denote the unique holomorphic cuspform 
of weight 4 on (5). We think of g as a “simple” modular form, and A is perhaps 
the most widely-recognized modular form. Each of these forms is catalogued in the 
LMFDB [LMF19], referenced by a label. The A function has label 1.12.a.a.1.1 and 
g has label 5.4.a.a.1.1. 

We also consider two more complicated modular forms: f05, a weight 2 modular 
form on I'(105), identified by label 105.2.a.a.1.1 and fio(z), a weight 20 modular 
form on I'(10), identified by label 10.20.a.a.1.1. 

We should expect forms of similar weight to share characteristics, and we 
should expect forms whose levels have similar numbers of cusps to share some 
characteristics. Here, we have chosen levels that correlate strongly with the number 
of cusps. Thus the two forms of smaller level have fewer distinct cusps than the two 
forms of larger level. 

Note that g(z) has small level and small weight, A(z) has small level and 
moderately large weight, fio05 has very large level and very small weight, and fio 
has moderately large level and high weight, 

In the plots that follow, we will show these four forms in the order 
(g, A, fios, fio) from top to bottom. To produce the plots, we approximate each 
modular form by its first 400 Fourier coefficients and evaluate the function on a 
600 x 600 grid, and store it as a PNG. We plot each form on the Poincaré disk, 
on a natural box in H, and then on a “zoomed-in” portion of H. Specifically, all 
“zoomed-in” plots on H are of the region [0.1, 0.4] x [0, 0.25]. For the other plots on 
H, we plot A on [—1, 1] x [0, 2], g on [—2.5, 2.5] x [0, 2], fios on [—1, 1] x [0, 1], 
and fi9 on[—1, 1] x [0, 2]. 

The region choices for A and g are natural: these are each wide enough to include 
at least one period, and tall enough to capture most interesting properties. (This 
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will be apparent in the graphs.) For fios and fj, it is not practical to plot a box 
sufficiently wide to capture a period. Instead we plot a smaller region. 

We plot each of these four forms on each of the three regions with a variety of 
different visualizations and discuss each in turn. 


Remark 1 We do not intend to emphasize one type of visualization as being univer- 
sally better than all the others. As we will see, each type has flaws and only presents 
a single view. Nonetheless, at the end of paper we give our recommendations for 
producing visualizations. 


2.2.1 Standard Domain Coloring 


In Fig. 1, we present default plots (as created by complex plot in SageMath). 
In these plots, the arg z is represented by hue and |z| is represented by brightness, 
where 0 is black and 00 is white. 

The fact that each form plotted is a cuspform is immediately clear as each plot 
on the disk is dominated by the color black (corresponding to the overall size being 
small). Further, a closer look will reveal that each cusp appears to be visibly dark. 
This is most obvious in the two lower level forms, g and A, where each cusp has a 
relatively large dark region around it. As the number of cusps increases, the behavior 
around cusps becomes more varied. Very near the boundary, our approximation for 
the form is less accurate. This is one factor contributing to why the cusps of {9 are 
far less pronounced. 

The weight of each form manifests primarily in how quickly the form can go 
from very bright to very dark. Here, the two forms of higher weight, A and fj, 
have regions of near-white brightness near the boundary that very rapidly darken 
towards the top of the disk. The two forms of lower weight are much more even in 
brightness. The high weight of f19 is another factor contributing to why its cusps 
are less pronounced: there are smaller regions of darkness around each cusp. 

It is possible to adjust the mapping from magnitude to brightness, but in practice 
these plots look roughly the same for a wide choice of maps and it takes quite a bit 
of tuning to choose an appropriate map.! 


' Adjusting the map from magnitude to brightness for SageMath’s complex_plot is possible, 
but nontrivial; the map is defined internally and the current plotting interface doesn’t give any 
option to choose or alter this map. In order to adjust this map, it is necessary to modify the source 
for SageMath’s plotting routines directly. 
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Fig. 1 Default plots using complex plot in SageMath. From top to bottom, the forms are 
g, A, fios, and fio 


2.2.2 Magnitude Plot Without Color 


We now produce plots after ignoring the phase of f(z) and instead only using the 
magnitude | f(z)|. There remains a choice of how to map magnitude to brightness; 
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Fig. 2. Magnitude plots where the magnitude corresponds to darkness. Black corresponds to zero 
magnitude. From top to bottom, the forms are g, A, fios5, and fio 


it is common to use maps of the form arctan(log(| f(z)|* + 1)) for some 0 < a < 1. 
Taking a = 0.25 yields the plots in Fig. 2. We note that adjusting the power a is the 
most natural and clearest way to adjust the overall darkness/lightness scale of this 
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style of plot: larger alpha causes points near zeros to be much darker and points near 
poles to be much lighter; smaller alpha decreases this effect. 

Qualitatively, each of these pictures offers very similar information as those 
in Fig. 1. The lack of phase information makes it slightly more challenging to 
distinguish between the images of A and g in the disk. But note that it is very easy 
to determine forms of low weight and high weight, as the forms with low weight 
have much slower changes in brightness. 

The complicated and varied behavior of f19 near the boundary is more evident in 
this plot. The presence of many more points of small size is made far more obvious. 
It is just barely possible to observe the region of decreasing size around the 0 cusp. 


2.2.3 Magnitude Plot with Periodic Linear Color 


In the LMFDB, magnitude is mapped periodically to color.” That is, hue represents 
the magnitude modulo 1, where blue is zero, and increases through purple, red, 
orange, yellow, and so on. In Fig. 3, we mimic the plots in the LMFDB attained by 
ignoring the phase of mapping the magnitude mod | to hue. 

Recall from Sect. 2.2 that each of the four forms we are considering has a label 
on the LMFDB, and we gave links to their homepages. Each page has a medallion 
with a plot similar to the plots on the Poincaré disk given here. 

The large bluish blob corresponds to a region where the magnitude of the form 
is never more than |. The larger weights of A(z) and fj correspond to more rapid 
growth away from cusps, leading both to smaller bluish regions and more tightly 
packed lines; fg has larger weight, and both of these traits are even more prominent 
in its graphs. 

For the lower weight forms g and fos, this is a very accurate plot. But A(z) 
and fi9 change too rapidly near the boundary, causing a static-like appearance. 
Increasing resolution helps, but in practice it is not feasible to produce sufficiently 
high resolution images to completely make sense of the boundary with this type of 
plot. 

This sort of plot is very well-suited for low weight forms of any level. The small 
weight means that even at moderate resolution, the image will appear well-defined 
even near the boundary. Then the cusps associated to the level will be visually 
apparent by the varying types of behaviors near the boundary. This is most obvious 
in the detail images for g and fjo95, where we can clearly see the greater variety in 
behavior near the cusps at the boundary in plots of /{95. 


?In the period between writing and publishing this paper, the LMFDB changed its method for 
plotting modular forms to incorporate methods described in this paper. Thus in the rest of this 
paper, LMFDB-style plots refer to the style of plots in the LMFDB on October 1, 2020. 
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Fig. 3. Magnitude plots mimicking the style of the LMFDB. The phase is ignored; the hue 
represents the magnitude modulo 1. From top to bottom, the forms are g, A, fi95, and fio 


2.2.4 Magnitude Plot with Periodic Logarithmic Spacing 


In Fig. 4, we change yet again the map from magnitude to color. We now allow the 
hue to represent log, (| f(z)|) mod 1 for some base w. Thus two consecutive bands 
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Fig. 4 Magnitude plots. The phase is ignored; hue represents the log of the magnitude, modulo 1. 
From top to bottom, the forms are g, A, fi05, and fio 


of red will correspond to the points in one band being a times the size of the points 
in the other. 

We note that adjusting the factor between consecutive bands (i.e. choosing the 
base of the logarithm in the map from magnitude to color) controls the number of 
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spacing of bands, and thus it is possible to make the plots visually simpler or more 
complex by experimenting with this parameter. The plots in Fig. 4 were produced 
with a = 7. 

The effect in these plots is similar to giving a higher resolution of Fig. 3. A large 
amount of detail and data is visible within these plots; however this is double-edged, 
and the plots are now busier. The high growth rate in the forms of higher weight is 
indicated by many more tightly packed bands of color near the boundary. 

Note that there is a region where the resolution appears insufficient in the detail 
plot for A(z). This is actually caused by small errors in the approximation for A(z) 
interfering with the extremely tiny values of A(z) there. As can be seen from the 
nearly black portions of Figs. 1 and 2, A(z) is extremely small in this region, and 
thus even small errors in the approximation produce noticeable artifacts in the graph. 

These plots are particularly useful for plots on H, where even for high weight 
the plots appear mostly well-formed. In the disk, the rapid decay at the top of the 
disk (at the co cusp) causes tight concentric circles of color and attracts a lot of 
visual attention, even though the actual behavior is mostly regular and relatively 
uninteresting there. 


2.2.5 Pure Phase Plots 


Following the ideas of Wegert [WS10], we now consider phase plots. In Fig. 5, 
we ignore magnitude completely and only consider the phase. There is a natural 
mapping from phase to hue: we allow hue to represent arg(z)/2z77, and the natural 
periodicity of phase works well with the natural periodicity of the color wheel. 

Qualitatively, less behavior of each modular form is evident in its phase plot 
in comparison to its magnitude plot. This is partly due to the lack of poles and 
infrequent zeros away from the boundary. Poles and zeros are singularities in phase 
plots, and the behavior of the colors around these points describes the order and 
nature of these points. 

In our opinion, pure phase plots should not be used on their own to study or 
describe modular forms. However, pairing each image in Fig. 5 with its correspond- 
ing image in Fig. 3 or Fig. 4 completely represents the modular form on the indicated 
domain. 


2.2.6 Phase Plots with Magnitude Contours 


Again following Wegert [WS10], we create phase plots with contours representing 
magnitude in Fig.6. We indicate the contours by adjusting the lightness: below 
each contour we adjust the lightness to be darker and above each contour we adjust 
the lightness to be lighter. The sudden change from darker to lighter indicates the 
contour itself. 

The phase naturally maps to hue as in the pure phase plots in Fig. 5. Contours 
could be chosen with spacing defined linearly as in Sect. 2.2.3 or logarithmically 
as in Sect. 2.2.4. In these plots, we have chosen logarithmic spacing with respect to 
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Fig. 5 Phase plots. Magnitude is ignored, and hue represents the phase. From top to bottom, the 
forms are g, A, fos, and fio 


log». That is, two consecutive contour lines indicate that the points along one have 
twice the magnitude of the points along the other. 


Remark 2 We emphasize that it is not necessary to compute the contours them- 
selves. Instead, one defines a coloring of the complex plane that includes brightness 
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Fig. 6 Phase plots with magnitude indicated along contours. Away from the contours, magnitude 
is ignored. From top to bottom, the forms are g, A, fios, and fio 


adjustments along the desired contours. In this case, we defined brightness adjust- 
ments around points of magnitude 2” for integers n. Then simply plotting the 
function with respect to this domain coloring causes the contours to appear. This 
is a key idea why approaches based on domain coloring are simple to compute. 
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The plotting program used to produce these plots is available at [LD20]. 


These plots carry all the information visible in the pure phase plots of Sect. 2.2.5 
and some of the information contained within the logarithmically spaced magnitude 
plots of Sect. 2.2.4, but these two sets of information are not displayed equally. The 
colors grab attention far more than the brightness adjustments around contours— 
analogous to the lightness difficulties for the standard domain coloring plots of 
Sect. 2.2.1. These plots emphasize argument more than magnitude. 


3 Choosing Color 


We surveyed many different visualizations in Sect.2, but one thing remained 
constant: the choices of color. In most complex function visualization packages, 
there is only one color choice available. All visualizations in Sect. 2 were created 
using the only color scheme for complex plot available in SageMath.* 

Unfortunately, this color scheme has flaws. For one example, it has uneven 
perceptual contrast: it has points of locally high color contrast (giving the impression 
of additional local variation) and points of locally low color contrast (hiding local 
variation). Thus in Fig.5, for instance, green is particularly visible and the yellow- 
green border is particularly striking, even though there is no underlying feature that 
this identifies. Further, in the plots in the upper halfplane, the vertical bands of color 
seem to favor green-yellow at the expense of purple-blue. 

In his excellent article [Kov15], Kovesi describes this problem and details how to 
create colormaps that are perceptually uniform — that don’t have uneven perceptual 
contrast. His work led to the creation of new default colormap, called viridis, for 
Python’s matplot1ib plotting library. 

More recently, Nufiez, Anderton, and Renslow wrote an article [NAR18] describ- 
ing their Python module cmaputil, which helps make perceptually uniform 
colormaps that additionally support perception by those with various forms of color 
vision deficiency. They created the Python colormap cividis (obtained by modifying 
viridis, hence the name), which they show is a good colormap for those with reduced 
sensitivity to red or green light. It is possible to make other colormaps suitable for 
other types of perception using this tool. 

We have incorporated matplotlib colormaps into our plotting routines. In 
particular, we implemented phase plots and phase plots with contours using these 
colormaps. In this section, we describe this implementation and give examples of 
the resulting plots. 


Remark 3 SageMath and matplot1lib share many plotting routines, and newer 
versions of SageMath will default to these new perceptually uniform colormaps 


3The behavior in Matlab and Maple appears similar, though it is possible to define a color scheme. 
On the other hand Mathematica has an extensive library of complex plotting color schemes. 
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for real-valued plotting. But SageMath’s complex plot does not directly use 
matplotlib. We are working towards making these new plotting routines easy to 
use and publicly available, and possibly incorporating them into SageMath. 


3.1 Implementation Overview 


Some of the details of this implementation are specific to Python and SageMath, but 
the overall idea readily generalizes. 


(1) Begin with a desired complex function f, a region Q C C, and a desired 
colormap cm. 

Evaluate arg(f(z)) and | f(z)| for each point in an M x WN grid containing Q. 
(Optionally mask points (m, 1) in this grid outside of Q to avoid unnecessary 
computation. For example, to produce the plots of A and g in the disk, we mask 
points z with |z| > 1.) This gives an M x N x 2 array. 

Map each phase to [0, 1]. This is most often done through the natural map 6 +> 
(6/27) mod 1, but note that it is possible to add an offset © through @ bt 
(6/22 + ©) mod |. This might be done to choose which phase corresponds to 
the endpoints of the colormap. (See plots in Fig. 9 for examples). 

Each matplot1lib colormap is implemented as a function cm : [0,1] — 
RGBA, where RGBA = [0, 1]* is red-green-blue-alpha colorspace. We don’t 
currently use the alpha channel. Thus apply cm to each mapped phase and 
discard the alpha channel to get RGB. 


(2 


wm 


(3 


wm 


(4 


wm 


If the intention is to produce a pure phase plot, then (after discarding the 
magnitude) one now has an M x N x RGB array, and one now produces a plot 
using this data in this grid, either directly or using a plotting library. For example, 
matplotlib’s method matplotlib.pyplot .imshow will directly plot this 
array. 

However, if the intention is to produce a phase plot with (logarithmically spaced) 
magnitude contours, then additional work is necessary. We would like to adjust the 
brightness as done in Sect. 2.2.6, but the brightness of an RGB pixel is nontrivial to 
work with directly. 

Instead we convert each pixel to HSL colorspace (hue-saturation-lightness) 
and adjust lightness. As most plotting libraries use RGB (as does the underlying 
hardware), we convert back to RGB prior to plotting. Thus to produce a phase plot 
with magnitude contours we perform these additional steps. 


(5) Map each RGB element to HSL colorspace, where HSL refers to “hue- 
saturation-lightness”. In HSL, it is possible to directly manipulate the lightness. 
This gives an M x N x 3 array. 

(6) Map each magnitude to a lightness adjustment in the range [—1, 1]. For 
example, to have logarithmic contours, one could use a map of the form 
(log, (| f(z)|) mod 1)/2 with a fixed lightness adjustment in a neighborhood 
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of 0. This has the effect of creating logarithmically spaced contours. This gives 
an M x N x | array. 

We note that we refer to x mod | to mean the least nonnegative number of 
the form x + £ for € € Z, i.e. the “computer science” modulo operation. 

(7) For each (m,n), apply the corresponding lightness adjustment to each HSL 
value. (It may be necessary to renormalize lightness at this step.) This gives a 
single M x N x 3 array. 

(8) Map each HSL back to RGB to get the final M x N x 3 array. 


And now one can directly plot this array or use a library like matplotlib to 
plot this array, as noted above. 


Remark 4 Each step can be done very quickly if one uses vectorized array 
operations as in NumPy [Oli06]. We also note that Python’s colorsys library 
implements NumPy-vectorized conversions between RGB and HSL. Then the most 
time-consuming step is then the evaluation of f on the M x WN grid, as should be 
expected. 


3.2 Example Visualizations with Colormaps 


We now give example visualizations with matplot1lib colormaps. The three 
mostly commonly used matplotlib colormaps are currently twilight, viridis, 
and cividis. Each of these maps is perceptually uniform and designed for clean 
presentation of data. 

The twilight colormap is also cyclic. This means that the two ends of the 
colormap line up perfectly, and thus it is a good colormap to represent phase in phase 
plots. The viridis and cividis colormaps are not cyclic. There is a sharp difference in 
colors at the two ends of the colormap. Plotting a modular form using either viridis 
or cividis thus highlights the points where the phase crosses over the end of the 
colormap. 

In some cases, this may be undesirable. For exploratory visualization, using a 
cyclic perceptually uniform colormap often seems superior. But more generally it 
is sometimes desirable, or even advantageous, to emphasize certain aspects of a 
complex plot, and using a non-cyclic colormap to highlight phase-changes is just 
one possible method. 

In Fig. 7, we plot the modular form g in four different color schemes: the default 
in SageMath, and the three matplot1ib colormaps twilight, viridis, and cividis. 
In contrast to the default color scheme, these three colormaps produce a plot that 
is more balanced in appearance. In particular, the perceptually uniform plots don’t 
overemphasize regions like the bright green and yellow portions in the default color 
scheme. 

Of course, perceptually uniform doesn’t mean perfect. In the twilight colormap, 
there is a sensation of peaks and troughs resulting from the light and dark areas of 
the plots; and as both viridis and cividis aren’t cyclic, the sharp color change at the 
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Fig. 7 Phase plots of g with four color schemes. The top left is the default color scheme of 
SageMath. The top right is twilight, bottom left is viridis, and the bottom right is cividis 


edges of the colormap gives the impression of contours. These are features of the 
colormap, not of the underlying function. 

In Fig. 8, we give phase plots in the cividis colormap. For half of the plots, we 
include logarithmically spaced contours, implemented as described above. These 
plots should compared with the phase plots in Fig. 5 and the contoured phase plots 
in Fig. 6. 

In these images, we have mapped the arguments to the colormap in such a way 
that the sharp change in color from light yellow to dark blue occurs when the phase 
is 2, or equivalently when the value of the modular form is a negative real number. 
On the upper halfplane, a modular form with real coefficients will take real values 
on vertical lines z = 5 + it forn € Z,t > 0. This explains the regularly spaced 
lines in the plots. 

On the one hand, this gives a strong and simple visual indicator of phase change. 
The regularity of this phase change is more obvious in these plots than the previous 
phase plots in the default color scheme. But on the other hand, this strongly 
emphasizes a small subset of phases. 
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Fig. 8 Phase plots with the matplot1ib perceptually uniform colormap cividis applied. In half 
of the plots, logarithmically spaced contours are included. From top to bottom, the forms are 
8, A, fios, and fio 


We note also that the colormap cividis is both visually appealing and widely 
accessible. 

In Fig. 9 we have included plots in a variety of other matplot1ib colormaps. 
For the top six plots, we used the three colormaps viridis, twilight, and coolwarm. 
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Fig. 9 Phase plots with a variety of colormaps from matplot1ib applied. From the top left and 
going row by row, the (function, colormap) are (A, viridis), (A, twilight), (A, an offset coolwarm); 
(fio, viridis), (fio, plasma), (g, twilight); (fios, coolwarm), (fios, inferno), (g, Paired) 


Fig. 10 Two visualizations of A 


Note that in the middle plot in the second row, we have normalized the non-cyclic 
colormap coolwarm so that the sharp distinction between red and blue occurs when 
the argument of A is 37/2 instead of zr. In the bottom row, we applied the colormaps 
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inferno, plasma, and Paired. Both inferno and plasma are also perceptually uniform, 
but are very orange. The colormap Paired is a discrete colormap, which has the 
effect of coloring different regions of the complex plane based on discrete argument 
sectors. 

While each plot in Fig. 9 may not be appropriate for initial exploratory visualiza- 
tion, they demonstrate the ability to choose colormap to highlight certain behaviors 
or areas of the function. And most importantly, they demonstrate that implementing 
matplotlib colormaps gives access to an existing, very large library of color 
schemes. 


Remark 5 Choosing color schemes to highlight certain behaviors is not a new 
concept. For example, Frank Farris [Far15] has produced several pieces of art based 
on domain colorings. 


Finally, we note that it is possible to apply these colormaps to the previous 
styles of visualization. For example, we can produce a colormapped version of the 
default SageMath plot (as in Fig. 1) by changing steps (5)—(8) in the implementation 
described in Sect. 3.1; map the phase to RGB as described there, but map magnitude 
to brightness as in Sect. 2.2.1 or 2.2.2. 


A Strategy for Making Visualizations Having given examples of several different 
types of complex plot, we feel we should describe our current preferences for mak- 
ing plots. If we were to choose a single plotting method to present a visualization, 
we would use contoured phase plots as in Fig. 8. 

We have found that for initial exploration, it is very effective to pair a contoured 
phase plot as in Fig.8 with a plot showing the absolute magnitude, such as a 
colormapped version of the default Sagemath plot. 

In Fig. 10, we do this for A on the disk in the cividis colormap. The strengths 
and weaknesses of these two plots complement each other. Specifically, the relative 
magnitudes are very clearly given in the (colormapped) standard domain coloring 
plot on the right and the phase information is more apparent in the contoured phase 
plot on the left. The presence of phase information in the standard domain coloring 
helps to identify the same location in the two plots. 

After this initial investigation, it is possible to choose regions and colormaps to 
highlight different behaviors of the function. 

But overall, the idea of choosing colormaps for exploratory complex function 
visual analysis is young, and there remains much to be discovered. 
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A Prym Variety with Everywhere Good Mm) 
Reduction over Q(61) ey 


Nicolas Mascot, Jeroen Sijsling, and John Voight 


Abstract We compute an equation for a modular abelian surface A that has 
everywhere good reduction over the quadratic field K = Q(V61) and that does 
not admit a principal polarization over K. 


1 Introduction 


A foundational theorem of Abrashkin [1] and Fontaine [17] asserts that there is 
no abelian variety over Q with everywhere good reduction; this theorem is an 
analogue in arithmetic geometry of Minkowski’s theorem that every number field 
has a ramified prime. More generally, the finiteness of the set of isomorphism classes 
of abelian varieties over a given number field of given dimension and with good 
reduction outside a given finite set is a key ingredient in the proof of Faltings [15] 
that a curve of genus at least two defined over a number field has finitely many 
rational points. For this reason and many others, it remains an important task to 
explicitly study abelian varieties over number fields with controlled ramification 
[3, 34]. 

Moving beyond elliptic curves, Dembélé—Kumar [12] have given examples of 
abelian surfaces with everywhere good reduction over real quadratic fields. More 
recently, Dembélé [11] gave a classification of abelian varieties with everywhere 
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good reduction over Q(./d) with d = 53,61,73: assuming the generalized 
Riemann hypothesis (GRH), he proves that every such abelian variety is isogenous 
to the power of a unique abelian surface A’). Moreover, for d = 53, 73 he was able 
to find an explicit equation for a genus 2 curve whose Jacobian is isogenous to A’,. 
In this article, we finish off the case d = 61 as follows. 


Theorem 1.1 Let K := Q(/61), let v := (1+ V61)/2, and let X be the projective 
plane curve over K defined by the equation F (x, y, z) = 0 where 


F(x, y, z) = (Sv + 17)x* — (14 + 48)x7y? + (8v + 28)x7 yz 
— (4v + 14)x?z? + (10v + 33) y4 — (120 + 44) y>z (1.2) 
+ (2v + 26)y2z? + (4v — 16)yz? + (v — 6)z4 
Then the following statements hold. 


(a) The curve X is smooth of genus 3 and has bad reduction at the prime (2) only. 

(b) The curve X admits an involution t: (x : y : Z) bh (—x : y : Z) over K, 
and the quotient of X by t defines a map from X onto an elliptic curve E of 
conductor (64). An equation for E is given by 


—(10v + 34)y? = x9 — vx? + (-v + Ix = 1. 


(c) Up to isogeny over K, we have Jac(X) ~ Ax E, where A is the Prym variety of 
the natural map Jac(X) — Jac(E), an abelian surface over K with everywhere 
good reduction. 

(d) All geometric endomorphisms of A are defined over K, and End(A) ~ V3] 
so A is of GL2-type over K. Moreover, A is isogenous to A, over K. 

(e) The abelian surface A has a polarization over K of type (1, 2), but no abelian 
surface isogenous to A over K admits a principal polarization over K. 


Although the full classification results of Dembélé are conditional on the GRH, 
the realization in Theorem 1.1 of the particular abelian surface A uses a direct 
computation, making it unconditional. 

The abelian surface AG, arises naturally as an isogeny factor over Q(V61) of a 
modular abelian fourfold Ag, associated via the Eichler-Shimura construction to the 
Hecke irreducible space S2(IT9(61), x) of classical modular forms of level 61 with 
quadratic character x, a space having nontrivial inner twist. Shimura [35, p. 210] 
conjectured and Casselman [9, Theorem 1.1] proved that A,, has everywhere 
good reduction (more generally, for good reduction properties of the Jacobians of 
modular curves, see Katz—Mazur [23, Theorem 14.7.2]). Using the Atkin—Lehner 
involution and explicit period computations, Dembélé—Kumar [12, Remark 3] 
observed that A¢, is naturally (1, 2)-polarized but not principally polarized (using a 
general criterion of Gonzalez—Guardia—Rotger [18]); consequently, this surface was 
an outlier for their method (the “sole exception” in the range of their computations). 

We construct X as follows. First, we compute the period lattice of the modular 
abelian fourfold Ag; (attached to $2(I9(61), x)) to large precision. Adapting the 
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methods of Costa—Mascot-Sijsling—Voight [10], we then numerically split Ag; as 
the square of a simple abelian surface with a (1, 2)-polarization—a building block 
in the language of Ribet and Pyle [32, 33]—providing a numerical candidate for A. 
We then glue with the elliptic curve E (in Theorem 1.1(b)) via analytic methods to 
find a curve of genus three over K, but with a terrible equation. Finally, we simplify 
the cover to obtain X — E as in Theorem 1.1 and confirm that the Prym of this 
cover has everywhere good reduction as claimed. Code that was used to obtain the 
results in this article can be found at [31]. 


Contents The paper is organized as follows. In Sect. 2, we explain explicit methods 
to compute the period matrix of a modular abelian variety to large precision. In 
Sect. 3, we illustrate our adapted methods to numerically decompose a complex 
abelian variety with a focus on our example. In Sect.4, we realize the resulting 
building block as a Prym explicitly; then in Sect. 5 we simplify the equations for the 
curve X and verify the properties announced in Theorem 1.1. 
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2 Computing the Period Matrix 


In this section, we show how to compute the period matrix of our modular abelian 
variety to large precision. We begin by recalling how this variety shows up as a factor 
in the decomposition up to isogeny of the Jacobian of the modular curve X; (61). 
After this, we explain how to determine the periods of a newform of weight 2 
in terms of its g-expansion. Finally, we show how to compute many coefficients 
of this g-expansion in reasonable time, so as to be able to compute the period 
matrix with very high accuracy. (The current implementation in MAGMA [6] allows 
computations to low precision; this is not enough for our purposes.) 


Conventions Before we begin, we set up a few conventions we use throughout. 
Let K be a field with algebraic closure K“. For an abelian variety A over K, we 
denote by End(A) the endomorphisms of A defined over K; if we wish to consider 
endomorphisms over a field extension L > K, we write A, for the base change 
of A to L and write End(A,) for endomorphisms over L. 


Setup Let k € Zso andn, N € Zyo be integers. We denote by ox(n) := Yodan dk 
the sum of the k-th powers of the positive divisors of n. In particular, o9(7) is the 
number of divisors of n. 
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Given a subgroup H < (Z/NZ)”, let 
Ty (N) := {(“ € SL2(Z) : c=0 (mod N) and a,d mod N € at. 
c 


By a newform of level [y(N), we mean a newform of level ['1(N) whose 
nebentypus character ¢: (Z/NZ)* —> C% satisfies H < kere. Let Na (My(N)) 
denote the finite set of newforms of weight k and level [y(N). This set is acted 
on (the left) by Galg, the absolute Galois group of Q, via the coefficients of q- 
expansions at the cusp oo: if f(g) = yy an(f)q” lies in Ni (Ty (N)), so that in 
particular a; = 1, then the coefficients {a,(f)}, are algebraic integers, and for t € 
Galg we have tf € Ni (Vy (N)) with g-expansion (tf)(q) = Yon T(an(f))q". 
Let Galg\Ni(T'H(N)) be the set of Galois orbits of Ny(y7(N)). For each f € 
Ni (Cy (N)), we denote by Ky := Q({an(f)}n) the number field generated by the 
coefficients of f. The field K ¢ contains the values of the nebentypus character ¢ f 
of f; furthermore the nebentypus of t(f) is €x(f) = T o €¢ forall tT € Gg. 

Let H* := HUP!(Q) denote the completed upper half-plane. Recall that for N € 
Zs and H < (Z/NZ)*, the modular curve Xq(N) := [q_(N)\H* attached 
to the congruence subgroup I'y() is defined over Q, and its Jacobian Jy(N) 
decomposes up to isogeny over Q as 


InN) ~ T] I] aaa (2.1) 


M|N fEeNo(Galg\l Ay (M)) 


where Hy < (Z/MZ)* denotes the image of H in (Z/MZ)*. Moreover, for 
each Galois orbit f € Galg\WN2('y(N)), the abelian variety A+ is simple over Q 
with dim Ar = [Ky : Q], and Ay has endomorphism algebra End(A ¢) @ Q ~ Kf. 

Let T be the Hecke algebra of weight 2 and level (NV). Roughly speaking, A ¢ 
can be thought of as the piece of Jy (N) on which T acts with the same eigenvalue 
system as f. More precisely, A ¢ is defined by 


Ap = JN)/T¢ AWN), (2.2) 
where I := {T € T : Tf = 0} is the annihilator of f under T, so that for instance 


the image of T;,|4 a End(A ¢) under the isomorphism End(A ¢) @ Q ~ Kf is the 
coefficient ay (f) € Kr. In particular, up to isogeny, the matrix 


[ OME 
¢ 


J 


whose rows are indexed by the newforms t(f) in the Galois orbit of f and whose 
columns are indexed by a Z-basis (yj); of 
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AY(XH(N), AU] = ty € Ai (Xn), Z): Ty = 0 forall T € If} 


is a period matrix of A f. 

It is therefore possible to extract a period matrix of A out of a period matrix of 
the modular curve X q (NV). While it may seem excessive to work with the periods of 
the whole Jacobian Jy(N) instead of A, this approach presents the advantage of 
making it easy to compute explicitly in the T-module Hi (X 7 (JN), Z) using modular 
symbols. 


Our Modular Jacobian Factor In what follows, we use LMFDB [26] labels 
for our classical modular forms. In our case, we observe that the Galois orbit of 
newforms attached to Ag, is 


f = 61.2.ba=q+aq’ + (a? +3)q? + O(g*) 


where a* + 8a2 + 13 = 0; the four newforms in this orbit correspond to the four 
embeddings of Q(a) ~ C, and their nebentypus is the quadratic character 761. 
Thus N = 61 and H = ker x61, the subgroup of squares of (Z/61Z)*. 

This Galois orbit 61.2.b.a happens to be the only one with quadratic nebentypus, 
whereas there are two Galois orbits with trivial nebentypus, namely 


61.2.aa=q—q? —2q°+ O(q*) 
whose coefficient field is Q, and 
_ 2 2 3 4 
61.2.ab=q + Bq? + (-B + 3)q° + O(9") 


whose coefficient field is Q(8) where 6? — B* — 38 + 1 = O. Since 61 is prime 
and since there are no cuspforms of weight 2 and level 1, the decomposition (2.1) 
informs us that Jy (61) is isogenous to the product of an elliptic curve corresponding 
to 61.2.a.a, an abelian threefold corresponding to 61.2.a.b, and our fourfold A6 
corresponding to 61.2.b.a. It also follows that the modular curve Xy(61) has 
genus 8, which is reasonable for practical computations. 

In order to compute the period lattice of Ag,, we must first isolate the rank-8 
sublattice H,(X 7 (61), Z)[/] of the rank-16 lattice H, (X47 (61), Z), where J < T is 
the annihilator of 61.2.b.a. We can obtain this sublattice as 


H\(Xp 61), ZU = (7 ker(T|4(x461),2)): 
Tel 


We observe that the Hecke operator ig + at + 13 lies in J, since Inf = 
ao(f)f = af and at + 8a? + 13 = 0; furthermore, the kernel of this operator 
acting on H\(X#(61), Z) happens to have rank 8, so that it agrees with the 
sublattice Hj (X 7 (61), Z)[/]. 
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Remark 2.3 The fact that the hunt for this sublattice was so short-lived is not a 
coincidence. Indeed, since 61 is prime and since there is no cuspform of weight 2 
and level 1, all the eigenforms of level 61 are new, so that T @ Q is a semi-simple Q- 
algebra by the multiplicity-one theorem. It is therefore étale, so that most of its 
elements generate it as a Q-algebra; in fact, an element of T @ Q is a generator if 
and only if its eigenvalues on each of the Galois conjugates of 61.2.a.a, 61.2.a.b, and 
61.2.b.a are all distinct. In particular, it is not surprising to find that T® Q = Q[T], 
so that J @ Q is generated by jie + 87s + 13, which explains why taking the kernel 
of this operator led us directly to the correct sublattice. 


Twisted Winding Elements We now recall a formula that we will use to compute 
the periods is f(z) dz of anewform f € No(T'y(N)) for y ranging over a Z-basis 
of Hi (Xy(N), Z). 

Since f is an eigenform, it is actually enough to compute these integrals for y 
ranging over a generating set of H)(X q7(N), Z) as a T-module, since 


/ (Quen / (Tf)\(@) dz = Arf) / Fede 
Ty Y Y 


for all T € T, where Ar(f) is such that Tf = Ar(f)f. 

Our strategy consists in integrating term-by-term the g-expansion of f along 
modular symbols chosen so that the resulting series has good convergence proper- 
ties. We choose to use the so-called twisted winding elements, that is to say modular 
symbols of the form 


sy = > X(-a){oo, a/m} 


a mod m 


where x is a primitive Dirichlet character whose modulus m is coprime to NV. 
Indeed, let ef be the nebentypus of f, and Ay € C% be the Fricke pseudo- 
eigenvalue of /, i.e. 


iar oe ae ee (2.4) 


n=1 


if N is squarefree, the exact value of 4 can be read off the coefficients a,(f) of f 
[30, Theorem 2.2.1]; in any case, a numerical approximation of A ¢ can be obtained 
by evaluating (2.4) at a particular point of the upper half plane. A computation [30, 
Proposition 3.2.3] based on the fact that the twist f @ x := yo an(f)x (n)q” 
of f by x is modular of level m?N and on the identity 


0 1 0 
my 

L =] "+f, 

os) love) t 


mV N 


yields the formula 
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om 1 (x@an(f) HOOGTD \ ss 
Le dem 2 am Nectopn LOBE) 


where R := e727/"VN and g(x) denotes the Gauss sum 


(2.5) 


a(x) := Do x(ayerert/™ 


aeZ/mZ 


of x (and the same with x). 


In view of the Deligne bound ay(f) <n ate it is necessary to sum numerically 
the series (2.5) up ton © my Ninel in order to compute this integral with D 
correct decimal digits. For instance, the homology of the curve X (61) introduced 
above happens to be generated as a T-module by the s, for x ranging over the 
set of primitive characters of modulus m < 7; therefore, in order to compute 
the period lattice of Ag; with 1000 digits of accuracy, we need to compute the 
coefficients a,(f) for n up to approximately 20000. We show how this can be done 
in reasonable time below. 


Remark 2.6 For safety, we actually computed the coefficients a,(f) and summed 
the series up ton < 24000. This 20% increase is quite arbitrary, but at this point 
we do not need to have a very tight control over the accuracy of our computations. 
Note that since we sum the series (2.5) using floating-point arithmetic and since its 
terms are roughly geometrically decreasing in magnitude, summing them in reverse 
(from n = 24000 to n = 1) increases numerical stability. 


High-Accuracy g-Expansions Classical algorithms, such as that based on mod- 
ular symbols or that based on trace formulas for Hecke operators, compute the 
coefficients a,(f) of anewform for n < B in complexity which is at least quadratic 
in B. Since we are aiming for B = 24000, these algorithms are not suitable for our 
purpose. We therefore turn to the method presented by Mascot [29, 3.1], which can 
compute the first B coefficients ay (f) in complexity which is quasi-linear in B. 

This method is based on the observation that any two rational functions u 
and v on an algebraic curve X are necessarily algebraically dependent, since the 
transcendence degree of the function field of X is only 1. There must therefore 
exist a polynomial ®(x, y) satisfying ®(u, v) = 0. Besides, this polynomial can 
be taken to have degree at most deg v in x and degu in y; indeed, a given value 
of u generically corresponds to deg u points of X, which in turn yield at most deg u 
values of v. 

Let f ¢ M(PxH(N)), let Ky be its coefficient field, and let Oy be the ring 
of integers of K ¢. Our method thus consists in using such a relation between the 
inverse of the j-invariant 


u:=1/j = + € qZ[Iqll (2.7) 
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where 


CO CO 
E4=1+ 240)“ o3(n)q", Eg =1- 504)“ o5(n)q", 


n=1 n=1 


and the rational function v := f/u’ € O,[[q]] where wu’ := q du/dq. 

The first B coefficients of u can be computed in quasi-linear complexity 
in B using the sieve of Eratosthenes and fast series arithmetic, and we can then 
determine the first B coefficients of v and hence of f = vu’ by Newton iteration 
using ®(u, v) = 0. 

In order to increase the efficiency of this computation, we run it modulo a large 
enough prime number p which splits completely in the coefficient field Ky of f. 
More precisely, let (@j) j<{K:Q) be a Z-basis of Oy, so that each element of O ¢ 
may be uniquely written as )* jrjoj with Aj € Z for all j. The matrix 


 T(@j) 


whose rows (resp. columns) are indexed by the complex embeddings t : K ¢ <> C 
(resp. the integers 1 < j < [Ky : QJ) is square, and invertible since the square 
of its determinant is the discriminant of K ;. Let (c;,r)j,r be the coefficients of its 
inverse, and define 


Ci max )> Icj,cl- 
J 


Then for all r > 0 and for any element 6 = bb j Ajj € O ¢, we have 
|t(@)| <r forallté: Kp > C => |Aj| < Cr forall j. (2.8) 


Since for all n € Zs, the coefficient a,(f) € Oy has absolute value at 
most o9(n)./n under every embedding t : K ¢ — C, if we require that 


p > 1+2C max ap(n)Vn, (2.9) 
n<B 


then for all n < B we can recover the exact value of a,(f) = par A j,n@j; from the 
Aj,n mod p, since we will always have p > 2|Aj.n| + 1. 

Our method is summarized in Algorithm 2.10; for more detail, see Mascot [29, 
3.1], [30, 3.3]. 


Remark 2.11 Although the complexity of this method is quasilinear in B, the initial 
computation of the a,(f) for n < b and the linear algebra required to find the 
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coefficients dm Of Pp(x, y) represent a significant overhead if the degrees of u 
and v are large. In our case, it is more efficient to proceed as follows: 


(i) Compute the first B coefficients of the newform fo = 61.2.a.a = gq — q? — 


2q* + O(q*) introduced above using Algorithm 2.10. This is faster than for f, 
since fo has trivial nebentypus, so that instead of X 7 (61), we can work on the 
curve X0(61), where the degree of u = 1/j is smaller. 


(ii) Compute the first B coefficients of f using a variant of Algorithm 2.10 based 


on a relation between the functions u = 1/j and v = (f/fo)*. Since the 
nebentypus of f has order 2, this can again be done on the curve X0(61). 


Algorithm 2.10 High-accuracy q-expansion in quasi-linear time 


Input: A newform f and B € Zs}. 
Output: The coefficients a,(f) forn < B. 
Algorithm: 


1 


lon 


. Evaluate the degrees of the rational functions u = 1/j and v = f/u’ (where u’ = 


q dq /du) on the modular curve X q (NV) in terms of the index [SL2(Z) : "y(N)] 
and of the ramification of the j-invariant map. 


. Use Riemann-Roch to find a bound b € Z;, such that any linear combination of 


the uv” with m < degv andn < degu whose first b g-expansion coefficients 
vanish is necessarily the 0 function. 


. Compute the coefficients a,(f) for n < b using one of the quadratic algorithms 


mentioned above. 


. Pick a prime p satisfying (2.9) and which splits completely in K ¢, and compute 


the first B coefficients of u mod p using fast series arithmetic in F ,[[q]]. 


. For each prime p | p of Kf, in parallel: 


a. By linear algebra, find coefficients dm, € Fy such that the first b coefficients 


of 
, ) as uy” 


m<deg v n<deg u 


are all 0 mod p, and define 


Py(x, y) = >; ~ Am nx” y" € Fp[x, y). 


m<deg v n<deg u 


b. Use Newton iteration in F,[[qg]] over the second variable of the equa- 
tion ®,(u, v) = 0 mod p to find the first B coefficients of v (mod p). 


. Use CRT over the primes p to determine the first B coefficients of v (mod pO f). 
. Express the first B coefficients of f = vu’ (mod pOr) on the Z/pZ- 


basis (@j) j<[K ;:Q] of O ¢/pO ;, and deduce their exact value thanks to Deligne’s 
bounds. 
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In general, if we wanted the coefficients of more modular forms (e.g., the 
other newform 61.2.a.b of trivial nebentypus), it is better to proceed inductively by 
replacing the j-invariant with forms that have already been computed (such as fo 
and f), since this leads to polynomials ®y(x, y) of much lower degree [30, 3.3]. 


Using this method, we can obtain the coefficients a,(f) for n < 24,000 as 
elements of K ¢ = Q(a), where a* + 8a? + 13 = 0. We then deduce the complex 
coefficients of each of the newforms t(f) in the Galois orbit of f by applying each 
of the complex embeddings of Ky to these coefficients. It then remains to evaluate 
the integrals J st (f)(z) dz using (2.5), and then to extract the periods of our abelian 
fourfold Ag), as explained at the beginning of this section. 


3 Isolating the Building Block 


The result of the algorithms in the previous sections is a 4 x 8 period matrix II € 
M,z,.s(C) with respect to some basis of the homology of the corresponding abelian 
fourfold Ag,. In this section, we find the period sublattice corresponding to an 
abelian surface as isogeny factor. 


Finding a Polarization The homology basis used for the period computations in 
the previous section need not, in general, be symplectic. Indeed, the computation 
of the intersection pairing on the homology of a modular curve is currently 
implemented only for the case of Xo(N). Therefore, we start this section by briefly 
describing how to numerically recover a polarization for IT. 


Remark 3.1 Dembélé—Kumar [12, §4.1.1, Method 1] showed that when the char- 
acter is quadratic (as in our case), the Atkin—Lehner involution splits the modular 
abelian variety, giving a polarization whose degree is a power of 2. In cases where 
the resulting polarization is not principal, they search (in the Hecke algebra) for a 
suitable principal polarization, so the method we give can be seen as a more general 
version of this approach. 


More generally, let IT € Mg,2¢(C) be a period matrix, with corresponding abelian 
variety A = V/A, where V = C8 and A = 1Z?8. Having TI at our disposal yields 
an identification of the homology group H\(A, Z) with Z?8. Multiplication by i 
on A yields an endomorphism J of H(A, R), which we similarly identify with the 
unique matrix J € Mo,(IR) obtained from the equality [2, §6.1] 


il = Ty, (3.2) 


Via the above identification and the Riemann relations [5, 4.2.1], finding a polar- 
ization for A comes down to finding an alternating matrix E € Mo,(Z) sat- 
isfying J'EJ = E (where ‘ denotes transpose) and the positive definiteness 
condition i1E~!I1* > 0 (where * denotes the conjugate transpose). 
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We first focus on the linear equations 


E'=-E 
(3.3) 
JEJ=E. 


Let m = g(2g—1). We choose a Z-basis of size m of alternating matrices Mo. (Z)att 
(e.g., indexed by entries above the diagonal). Numerically solving the equations 
(3.3) then reduces to finding common approximate zeros of a number of linear 
forms £),...,€, : Z’" —> R. We determine a Z-basis of vectors v for which 
all £;(v) are numerically negligible by LLL-reducing the lattice spanned by the rows 
of the matrix 


In Celi srt CLin . (3.4) 


Here J,, is the m-by-m identity matrix, and the other columns correspond to the 
linear forms £; multiplied by a sufficiently large scalar C € Ryo. The first elements 
of the resulting LLL-reduced basis will give approximate solutions of (3.3), and we 
preserve those that do so to large enough precision to get a Z-basis of numerical 
solutions for (3.3). 

Having found a Z-basis of solutions of (3.3), we take small linear combi- 
nations until we find an element E of determinant 1 such that the Hermitian 
matrix il E~'I1* is positive definite. We summarize our steps in Algorithm 3.5. 


Algorithm 3.5 Numerically recovering principal polarizations 


Input: A period matrix I] € Mg,2¢(C), not necessarily with respect to a symplectic 
basis. 

Output: An alternating matrix E € M2, (Z) defining a principal polarization for the 
abelian variety corresponding to IT. 

Algorithm: 


1. Use LLL to find a Z-basis {E),..., Eg} of solutions of (3.3) in Mo9(Z)att. 
2. Take a (random) Z-linear combination E = ae c, E;, with c; € Z. 
3. If det(E) = 1 and iM E~'TIi* is positive definite, return E; else, return to Step 2. 


In this way, we quickly recover a principal polarization E on I. 


Remark 3.6 More generally, this method can be used to search for polarizations of 
any desired type. 


For our modular abelian fourfold A = A6;, whose period matrix TT € Ma.s(C) 
was computed in the previous section, we carry out Algorithm 3.5 and recover 
(numerically) a principal polarization on A. 
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Finding Small Idempotents With a polarization in hand, we numerically decom- 
pose the abelian variety A = Ag, following Costa—Mascot-—Sijsling—Voight [10]: 
we compute numerically that End(Ac) is an order of index 13 in M2 (Z[V3]). Using 
methods similar to those above, we find a Z-basis {(7), Ri), ..., (Tg, Rg)} for the 
pairs (7, R) with T € Ma(C) and R € Mg(Z) such that numerically 


TH =MR. (3.7) 


Specifically, Rj = 1, and the remaining matrices Rj are 


-10 0 1 1 0 0 0 2°0 =1 0 “0 <2 =1=2 000 1 1 0 0 0 
-1-1-10 1-12 0 100 1 1 0 1 2 000 0 2 0 2 0 
1 0 0 0-11 0 2 —-1-1-1-2-10 0 0 000 1 0 0 1 2 
1 0-1 0 -1-1-1 0 0-1-2 0 0 0 0 0 000 1 0 0 1 2 
101 01 1 1 «0 Lt. dt f=<12.1 2 00 0 -2-1 0 -1 -2 
1 0 -1 -1-1 -2 -1 -2 1-1-1 0 -1 0 -1 0 -10-10 -1-10 0 
10-1 0 -1-1-1 0 -1-10 1 1-20 0 000 2 0 0 0 2 
101 01 1 1 «0 12 2 01 14140 000-2 0 0 -1 -3 

10 0-1 0 —2-1-2 10 00 00 0 0 

0 0 -1-2-1 1-2-1 est SP S12: b =) 

0-1-1 1 001 0 0-11 000 1 

-1-10 00 0 1 0 0 0-1-10 0 1 

tt 0-=1 =1 0 =-1-+1 2.02 2 22-11 

-10 01 01 1 2 0 0 0 0 1-11 «0 

-1-10 100 0 1 2.0) 2:52: =1 =2:=1.=3 

1 1 0-1 0 0 -1-2 1 01 1 0 1-10 

1 0 1-10 0 0 0 100 0 0 0 0 0 

2 1 12-2 1 1-2-3 1 £ 0 —2 0 —-2-2-3 

3 -—1 —2 -1 -—1 —2 -—2 -3 1112121 21 «21 ~«21 

2-1 0-1-12 0 1 0 1 1 °0 1-1 0 -1 

-l 1-11-1011 —3-1-1 1-13 1 3 

-1-1-10-21 02 0 0 1 2 1 1 2 2 

1-12 1 0 0-11 2-11 4141-31-41 

-12 0 0 2-11 -2 10 -—2-2-2 2 -—2 0 


The matrices R; are a Z-basis for the numerical endomorphism ring End(Ac) C 
Mg(Z); we look for small idempotents therein as follows. Inspired by algorithms 
for computing an isomorphism with the matrix ring [16, 22], we equip Mg(Z) with 
the Frobenius norm (sum of squares of entries) VN: Mg(Z) — Z, a positive definite 
quadratic form; then idempotents (and, more generally, zerodivisors) are likely to 
be short vectors with respect to this norm. This very quickly identifies scores of 
idempotents 


Ro +1, Ra +1, Ro-—R34+1,Rsg,... (3.8) 


and we select one that gives us the simplest looking decomposition. 


Inducing the Polarization Let (T, R) be a small idempotent in End(Ac) ® Q as 
constructed above. We can then find an isogeny factor of A in two ways: 
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(i) Taking the kernel of T yields a linear subspace of W of C8 in which WN TZ78 
is a lattice. Choosing bases, we get a new period matrix IT’ as well as a pair 
abusively still denoted (JT, R) such that TT’ = TR. The matrix T represents 
the inclusion of W = C#™™) into V = C8. 

(ii) Alternatively, taking the image of T yields a linear subspace of W of C% in 
which TI1Z78 is a lattice. Choosing bases, we get a new period matrix I’ as 
well as a pair abusively still denoted (T, R) such that TT = I’ R. The matrix T 
represents the projection from V = C8 to W = CMM), 


Note that by the dimensions of the abelian varieties W in approaches (i) and (ii) 
above are complementary in the sense that they sum to g. 


Proposition 3.9 In approach (i), the restriction of E to W is represented by the 
matrix E' := R' ER, and E' defines a polarization for TY’. 


Proof It is clear that E’ is alternating. The complex structure J restricts to give the 
complex structure on W, in the sense that because TIT’ = IIR we have 


ORI’ =TM' J’ =iTT’ =iNnR=TJR, (3.10) 


so that RJ’ = JR. Therefore E’, as the restriction of E under R, is again compatible 
with J’, or more precisely 


JES =J "RERJ' =RI'EIR=RER=E’'. (3.11) 


Similarly, since E’ is the restriction of E under R, the corresponding Hermitian 
form on W remains positive definite, which is equivalent to iI’ E’~'N1’* > 0 (as in 
Birkenhake—Lange [5, Proof of Lemma 4.2.3]). oO 


Proposition 3.12 In approach (ii), an integer multiple of E’ := (RE7!R')~! 
induces a polarization on I’. 


Proof This follows from the previous proposition by double duality. The projec- 
tion (T, R) induces a dual map (TY, RY) : (A‘)Y — AY on the corresponding 
dual abelian varieties. Recall that we can identify AY = VY/AY, where VY := 
Home(V, C) is the set of antilinear maps from V to C and where A” is the dual 
of A under the canonical pairing between V and VY. The original pairing E gives 
rise to a map V — VY sending the lattice A into its antilinear dual A*. In other 
words, we have 


E(A@Q) =A“ OQ. (3.13) 


Therefore a suitable multiple of the inverse map E~'! : VY — V maps AY into 
its dual A. This defines a polarization on AY; it is compatible with the complex 
structure because F is, and it is positive definite for the same reason. We can transfer 
this polarization to A’Y by the previous proposition, using the inclusion (TY, RY) 
dual to the projection (JT, R). Note that RY = R' by duality. Dualizing a second 
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time, we get our induced polarization on A’ after again taking a suitable integral 
multiple if needed. Oo 


Remark 3.14 If the morphism (7, R) in the previous proposition corresponds to a 
map of curves of degree d under the Torelli map, then E’ is in fact d times the 
principal polarization on IT’ by Birkenhake—Lange [5, Lemma 12.3.1]. However, 
the above notion of an induced polarization applies whether (7, R) comes from a 
map of curves or not. 


We apply approach (ii) to the chosen polarization E and idempotent (7, R) 
for our period matrix IT € Mg4.3(C) and obtain a matrix Tz € Mo4(C) with a 
polarization E2 of type (1, 2), which defines an abelian surface Az. Note that the 
above approaches are essentially equivalent by duality; we slightly preferred the 
second as it comes with a projection map A —> Ao. 


4 Computing an Equation for the Cover 


The (1, 2)-polarized abelian surface Az defined by I1z and E> in the previous 
section was obtained from a numerical idempotent of II defined over the base 
field K = Q(v/61). Therefore the abelian surface Az is also defined over K. 
However, the polarization E2 is not principal, so we do not obtain in this way a 
Jacobian of a genus 2 curve over K. In this section, we show how to realize Az as 
the Prym variety associated to a map X — Aj, from a genus 3 curve to a genus | 
curve. 

Along the way, we will continue our numerical computations and retain our sen- 
sitivity to fields of definition; at the end, we will rigorously certify our construction. 

To obtain a principal polarization, one can first try to take a maximal isotropic 
subgroup of the kernel of the isogeny A — AY” defined by E>. This kernel 
is isomorphic to Z/2Z x Z/2Z and carries a non-trivial Weil pairing; there 
are therefore 3 such maximal isotropic groups. Calculating the corresponding 
principally polarized quotients of Az, one obtains that all of them give rise to 
indecomposable polarizations and therefore correspond to algebraic curves under 
Torelli. However, calculating the corresponding Igusa invariants, or alternatively 
reconstructing after Guardia [19] (implemented at [36]) shows that all of them 
define the same cubic Galois extension L of K, namely that defined by the 
polynomial x7 — /61x? + 4x + 2. 

This cubic polynomial also shows the way out. We will glue A2 with an elliptic 
curve A, over K to realize A2 as the Prym of genus-3 double cover of A; over K. 
This is a slight generalization of the same gluing operation when both A2 and A, 
have a principal polarization, which will be considered in detail in the upcoming 
work by Hanselman [20] (for code, see Hanselman-Sijsling [21]). 

Via the aforementioned cubic polynomial, we consider the elliptic curve 


Ay =e aS Bix? 2452. (4.1) 
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Let IT2 be a period matrix for the 2-dimensional abelian variety A2 with respect to 
which the polarization E2 is represented by the alternating form 


— 


0 
E2 = 0 (4.2) 


0 
0 
0 


The elliptic curve A; will have a principal polarization represented by the standard 


symplectic matrix Ey = (° a \ and with corresponding period matrix I]; say. 


Consider the period matrix 


TI2 0 
I, = 4. 
3 ( 0 *) , (4.3) 


with corresponding abelian variety A‘; it admits a polarization represented by 


(4.4) 


i) 


Let G := ker(E3) < ANIA, where A’ = Hj(A3,Z) = 11,28. Then G is 
equipped with the basis corresponding to the final four columns of £4, and under 
the corresponding identification 


G ~ (Z/2Z)* x (Z/2Z)" (4.5) 


the Weil pairing on G is the product pairing. By Birkenhake—Lange [5, 2.4.4], the 
quotients of A on which the polarization E4 induces a principal polarization are 
in bijection with the maximal isotropic subgroups H < G—there are 15 such 
subgroups. 


Proposition 4.6 Suppose that A’, and G are defined over K. Then there exists 
a maximal isotropic subgroup H < G that is defined over K. In particular, the 
quotient A’,/H is defined over K. 


Proof The first factor in (4.5) corresponds to the kernel of the polarization E2 
on A». By the discussion at the beginning of this section, we know the structure 
of this group as a Galois module: the elements of order 2 are cyclically permuted by 
a generator of the Galois extension L of K defined by x? — /61x? + 4x + 2. This 
is the very same Galois module structure as that on the second factor in (4.5), which 
corresponds to A,[2]. 
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We can therefore obtain a Galois-stable isotropic subgroup H < G by starting 
with a pair (x1, x2) with x; A O and x2 ¥ O and taking the image under Galois, after 
which we adjoin 0: 


H := {(0, 0), (x1, x2), (6 (1), 6 (22)), 
(07 (x1), 07(x2)) = (41 +o (x1), x2 +o (x2))}- (4.7) 


Since o;(x;) and o;(x;) pair to 1 fori # j and similarly for x2, this subgroup is 
indeed isotropic, because the product pairing on G assumes the values 0 + 0 = 0 
and 1 + 1 =0 (mod 2). By construction, H is defined over K. 

The second part of the proposition follows immediately. oO 


Given a maximal isotropic subgroup H of G, say with generators h; and h2, we 
take inverse images h, and 2 in 5A! . This yields a lattice 


A=A'+Zh, + Zho. (4.8) 


that contains A’. Choosing a basis of A, and keeping the basis for A’ used 
heretofore, we find a matrix R3 € M6(Z) such that AR3 = A’. By construction, E; 
induces a principal polarization on the abelian variety corresponding to the period 
matrix 


113 = 115R;. (4.9) 


Note that we have a projection map corresponding to the pair (/3, R3) in the 
equality /3I1, = I13R3. Under this map, the induced principal polarization on 113 
corresponds to the polarization class from Proposition 3.12. 

While we can in principle identify the subgroup H in terms of the columns 
of E3 by using the Abel-Jacobi map from [27], we just tried all 15 subgroups, 
as in Algorithm 4.10. 

Invoking this calculation using 1000 decimal digits, only one isotropic subgroup 
numerically gave rise to invariants over K; we obtain in this way a period 
matrix I13 whose abelian variety Az := C?/I13Z° is equipped with the principal 
polarization E3. 

Finally, using the main result in Lercier—Ritzenthaler—Sijsling [25] (recognizing 
invariants), we constructed a plane quartic equation defining a genus 3 curve whose 
Jacobian (numerically) matches the abelian surface A3. 


5 Simplifying, Twisting, and Verifying the Cover 


The methods from the previous section furnish us with an equation F(x, y, z) = 0 
defining a smooth plane quartic curve X C P? over K whose Jacobian putatively 


A Prym Variety with Everywhere Good Reduction over Q(61) 575 


Algorithm 4.10 Finding a principally polarized quotient over K that is a Jacobian 
Input: The period matrices I12 and I]; for the factors Az and A, over K, with 
polarizations E> and FE, of type (1, 2) and (1) respectively, along with the associated 
matrices IT; and E% as in (4.3) and (4.4). 

Output: A matrix Tz; = T,R3 on which E’, induces a principal polarization E3 
such that I13 numerically corresponds to a genus-3 curve X over K, along with £3 
and the invariants of X. 

Algorithm: 


1. Determine the 15 isotropic subgroups H of G = ker(E3) = (Z/2Z)* x (Z/2Z)* 
with the product Weil pairing. 

2. For a given H as in Step 1, lift generators h; and hz to elements hy and hy 
of 5A/A'. 

3. Construct A = A’ + Zhy + Zh, choose a basis of A, and find R3 € M6(Z) such 
that AR3 = A’. 

4. Let Il3 = 11,R3', and let E3 = R;‘E,R;'. 

5. Using the methods of [24, §2] and LLL, check if the principally polarized abelian 
variety defined by I13 and £3 corresponds to an algebraic curve X, and if so, 
whether the invariants of X are numerically in K. If so, return I13, £3, and the 
corresponding invariants. Otherwise proceed to the next H in Step 1. 


contains the abelian surface A = Ap and the elliptic curve E = A, in Theorem 1.1 
as isogeny factors over Q*!. Since this reconstruction from invariants is only up 
to Q*-isomorphism, this Jacobian may not yet contain the given factors over K. 
Moreover, the implementation of these methods does not yet optimize the equations 
involved when working over proper extensions of Q; indeed, the first homogeneous 
defining polynomial F is of a rather terrible form, whose coefficients are thousands 
upon thousands of decimal digits long. In this section, we simplify, twist, and then 
verify that the cover is as claimed. 


Simplification Since the ring of integers of K has class number 1, a first step is 
to remove superfluous primes from the discriminant by generalizing methods of 
Elsenhans [14]. This approach makes the equation somewhat smaller, but its size is 
still thousands of digits. 

Although we constructed X from a period matrix I, the reconstruction methods 
involved only uses invariants and does not furnish the period matrix of Jac(X) 
in terms of ITI. Fortunately, using integration algorithms due to Neurohr [28], 
we can simply recompute the period matrix of Jac(X) from scratch. Then, with 
the techniques of Bruin—Sijsling—Zotine [7] one can find the numerical geometric 
automorphism group Aut(Xc) of the curve X along with its tangent representation 
on the ambient P”. It turns out that Aut(Xc) ~ Z/2Z, and in particular the nontrivial 
involution ¢ generating Aut(Xc) is putatively defined over K itself. 

Let T € GL3(K) be a matrix representing 1, ie, F - T = AF under the 
substitution acting on the right. Since X is canonically embedded, we can read off 
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the matrix T from its tangent representation. Moreover, we can write T = UDU™!, 
where D is a diagonal matrix with diagonal entries (—j, jz, “) say. Then for H = 
F-U wehave H-D = F-UD=2F-U = AH, so that H(—x, y, z) =AH (x, y, Z). 
In this way, we recognize an exact automorphism and reduce to the case where 


Fa,y,z= G(x’, y, 2), (5.1) 


with G homogeneous in x, y, z of weights 2, 1, 1. Now the equation G(x, y, z) = 0 
defines a genus-1 curve E, which is the quotient of X by its involution ¢ and which 
is still defined over K. The map X — E is ramified over the divisor of zeros D of x 
on E. 

We then calculate the j-invariant of the binary quartic qg(y, z) corresponding 
to the genus-1 curve E. An elliptic curve Eg with the same j-invariant over K is 
defined by 


Eo: uw? = po(t) 62} 
over K, where 
po(t) = (10v + 34)? + vt? + (1 — v)t +1) (5.3) 


where v := (1 +4/61)/2 € K satisfies »? = v +15. The conductor of Eo is (64). As 
luck would have it, the binary quartic g is equivalent over K to the quartic z4 po(y/z) 
corresponding to the polynomial po, yielding an isomorphism FE Ss akege Pushing 
the divisor D along this isomorphism, one obtains a divisor Do on Eo. Again luck is 
on our side and the divisor Do — 400 on Ep is principal; the corresponding degree-2 
cover Xo of Eo is defined by the affine equations 


(10v + 34)u? = + 007+ —v)t+1 


0: (5.4) 

(—2v + 9)w* = 2u + (—7v — 24)t? + (404+ 14)t — 2-7 
Eliminating u using the second equation and substituting in the first, replacing w <— 
x andt < y, and homogenizing yields the defining homogeneous quartic 


Fo(x, y, 2) = (—195v + 859)x4 + (—64v + 282)x7y? + (—5v + 24)y4 
+ (148v — 652)x?yz + (—16v + 68) y?z + (—74v + 326)x727 


+ (96v — 422) y?z7 + (—148v + 652) yz? + (—47v + 207)z*. 
(5.5) 


We are blessed for one final time because Fo defines a non-singular curve Xo C 
P* with the same Dixmier—Ohno invariants as the original terrible curve X. (This 
is not automatic, as a degree 2 cover of a genus-1 curve is not determined by its 
ramification locus only, in contrast to the case of genus-0 curves.) In any event, we 
have finally obtained a simplified equation Xo. 
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Twisting It remains to find the correct twist of Xo (with respect to the involution 
on Xo) that matches A» as an isogeny factor over K (instead of merely over Q?). 
Again, this step is needed because the algorithms in Lercier—Ritzenthaler—Sijsling 
[25] only yield the correct geometric isomorphism class. We proceed as follows. 

First, we considering the scheme over Z cut out by (5.5) and the equation for v. 
By a Grébner basis computation, we find that Xo has bad reduction only at the 
prime (2); so our desired twist will be by a 2-unit (in K *). We begin by computing 
generators for the group U := Zx[1/2]*/Zx[1/2]** of 2-units of K modulo 
squares, a group with 2-rank 3. 

To find the right twist, we loop over primes p > 5 that split in K and by counting 
points compare the L-polynomial L,(X,T) € 1+ TZ[T] of X and the absolute L- 
polynomial of f 


Lp(f,T) := (1 — pT + x(p)pT*)(l — t(ap)T + x(p)T*) € 1+ TZ(T], 


where (t) = Gal(K | Q). We find always that Lp)(f, £T) | Lp»(X, T) for a unique 
choice of sign €p. 


6 
We then look for an element 6 € U such that (5) = €p for each prime p 


above p: we find a unique match, namely 6 = —5v + 22, an element of norm —1. 
Twisting by this element (replacing x7 <— 6~!x*) gives an equation all of whose 
good Euler factors match: 


F(x, y, z) = (Sv + 17)x* — (14 + 48)x7y? + (8v + 28)x7yz 
— (4v + 14)x7z7 + (10v + 33) y* — (12v + 44) y2z (5.6) 
+ (2v + 26)y2z? + (4v — 16) yz? + (v — 6)z4. 
Verification We now conclude with the proof of our main result (Theorem 1.1); for 


convenience, we reproduce the statement. 


Theorem 5.7 Let X be the projective plane curve over K defined by the equa- 
tion F(x, y, Zz) = Oas in (5.6). Then the following statements hold. 


(a) The curve X is smooth of genus 3 and has bad reduction at the prime (2) only. 

(b) The curve X admits an involution t: (x : y : Z) bh (—x : y : Zz) over K, 
and the quotient of X by t defines a map from X onto an elliptic curve E of 
conductor (64). An equation for E is given by 


—(10v + 34)y? = x9 — vx? + (-v + Ix = 1. 
(c) Up to isogeny over K, we have Jac(X) ~ Ax E, where A is the Prym variety of 


the natural map Jac(X) — Jac(E), an abelian surface over K with everywhere 
good reduction. 
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(d) All geometric endomorphisms of A are defined over K, and End(A) ~ ZV3] 
so A is of GL2-type over K. Moreover, A is isogenous to Aj, over K. 

(e) The abelian surface A has a polarization over K of type (1, 2), but no abelian 
surface isogenous to A over K admits a principal polarization over K. 


Proof Part (a) was verified directly via a Grébner basis computation as above: 
more precisely, we compute that the ideal N generated by F and its derivatives 
in Z[v][x, y, z], saturated at the irrelevant ideal (x, y, z), has NN Z[v] = Qj) 

Part (b) is because F(x, y, z) = G(x’, y, z), and G(x, y, z) defines an elliptic 
curve, equipped with the point at infinity (v — 3 : 1 : 0); we then compute the given 
equation (arising up to quadratic twist by construction). For the first part of (c), the 
splitting over K comes from the map in (b). 

Next, we prove part (d) using the methods of Costa—Mascot-Sijsling—Voight 
[10], which rigorously certifies that End(Jac(X)) = End(Jac(X*')) ~Zx ZV3], 
where X*! denotes the base change to an algebraic closure K*! of K. More precisely, 
with respect to a K-basis of differentials, we certify that there is an endomorphism 
with tangent representation 


1 0 0 
6 30 G=a (5.8) 
G32 =<372 


representing multiplication by /3. 

To finish the proof of (c), frustratingly it is not clear how to algorithmically verify 
that the Prym has everywhere good reduction directly. (We could not find an imple- 
mentation for the computation of regular models over number fields.) Instead, we 
look at Galois representations and apply the now standard method of Faltings—Serre, 
the relevant details of which we now sketch: see Dieulefait-Guerberoff—Pacetti [13, 
§4] and more generally Brumer—Pacetti—Poor—Tornaria—Voight—Yuen [4, $2]. 

The 2-adic Tate module of the abelian surface factor yields a Galois representa- 
tion 


paz: Galk > GLp(Za[V3}), (5.9) 


where Galx := Gal(K# | K) is the absolute Galois group of K; since (2) = (2 
ramifies in Z[V3], reducing modulo [ = (1 + V3) gives the mod [ representation 


PAL: Galx — GL2(F2). 


Counting points, we find that the conjugacy classes of Frobenius elements for either 
prime above 5 has order 3, so the image of p , ; has order 3 or 6, and correspondingly 
the fixed field of ker p> is an extension of K of degree 3 or 6 ramified only at (2). By 
computational class field theory, we verify there is in fact a unique such extension: 
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it is the normal closure of the extension of K defined by the polynomial po(f) in 
(5.3), whose absolute field is defined by 


x© 4x5 — 7x4 4 14x? — 16x + 4: (5.10) 
the full normal extension L of Q is defined by 


x? — 4x!" 4 10x! — 44x? + 17428 — 422x7 + 618x° — 512x° 
+170x4 + 56x? — 58x? + 6x + 9. (5.11) 


We repeat the same calculation for the newform f € S2(Io(61), x): its base 
change fx to K has level (1) and so gives a Galois representation ps, ,: Galx > 
GL2(Z2)) for which the characteristic polynomial of Frobenius lies in Za[ V3]. 
Reducing modulo [ and semisimplifying gives Pret: Galk > GL2(F4); since 
the traces belong to F, up to equivalence the representation takes values in Fz 
[37, Corollary 1], and as in the previous paragraph we confirm that it is surjective 
so absolutely irreducible so equal to its semisimplification: indeed, we must 
have (4; ~ Pry, by uniqueness. By a result of Carayol [8, Théoréme 2], up to 
equivalence the representation takes values in Za[V3], ie., we have pry (: Galx > 
GL2(Z2[V3]). 

We conclude by showing that p4,; ~ /¢ f,,1, from which the remainder of part (c) 
follows as fx has level (1). We compute Z,[1/2]* /Z,[1/2]** ~ (Z/2Z)? and that 
the primes of L above 3, 5, 61, 97 generate the corresponding elementary 2-abelian 
extension of L. The method of Faltings—Serre then provides that o4,, ~ pfy,1 if 
and only if we have the equality of traces trp, {(Frobp) = tr pr, ,{(Frobp) for the 
primes p of K above 3,5, 61,97, where Froby € Galx denotes (a representative 
of) the conjugacy class of the p-Frobenius automorphism. Having checked equality 
of L-polynomials for these primes (indeed, we checked all primes of norm up 
to 200), the result follows. 

Finally, having proven in the previous paragraph that A is modular, part (e) was 
proven for a particular abelian surface in the isogeny class of A by Dembélé—Kumar 
[12, Remark 3] (using a criterion of Gonzélez—Guardia—Rotger [18]). We now show 
it extends to all abelian surfaces in the isogeny class. We have exhibited a (1, 2)- 
polarization 4: A — AY, where AY denotes the dual abelian variety. Assume for 
purposes of contradiction that there exists an isogeny g: A’ > A and a principal 
polarization 4’ on A’, both over K. Then 


i= @Y 0od0g: A’ > (AY 


is another polarization, with deg(u’) = 2deg(y)?. From work of Gonzaélez— 
Guardia—Rotger [18, Theorem 2.3, Theorem 2.10(ii)], we conclude that ju’ corre- 
sponds to a totally positive element of End(A’) of norm deg(,z’). But End(A’) is an 
order in End(A’) @ Q = Q(73), and 2 is not a norm from this field—equivalently, 
the conic x? — 3y? = 2z* has no rational points over Q, or also equivalently, 
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the corresponding quaternion algebra (2,3|Q) 2% Mbo(Q) is not split, having 
discriminant 6—a contradiction. (The element 1 + 3 € Z[V3] has norm —2, 
and (2) represents the nontrivial element of the narrow class group of Z[V3].) oO 
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The S-Integral Points on the Projective ®) 
Line Minus Three Points via Finite Cheek for 
Covers and Skolem’s Method 


Bjorn Poonen 


Abstract We describe a p-adic proof of the finiteness of (P! — {0, 1, 00})(Z[S~!]) 
using only Skolem’s method applied to finite covers. 


2020 Mathematics Subject Classification Primary 11G30 


1 Introduction 


Let S be a finite set of primes of Q. Let R = Z[S~!]. Let X = Pe — {0, 1, co} = 
Spec R[x, x~!, (1 — x)~!]. The set X(R) is in bijection with {(x, y) € R* x R™: 
x+y=l]}. 

Siegel [Sie26] proved that X (R) is finite. Kim [Kim05] gave a new, p-adic proof 
of this fact, as an application of his nonabelian analogue of the Skolem—Chabauty 
method. Inspired by Kim’s proof, we give a different p-adic proof, using only 
Skolem’s method applied to finite covers. (In fact, the proof we present dates from 
an April 23, 2005 email to Kim following a talk he gave on his method, but we have 
not published our proof before now.) 


2 Review of the Skolem—Chabauty Method 


Let k be a finite extension of Q. Let S be a finite set of places of k containing all the 
archimedean places. The ring of S-integers in k is 
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R:= {x €k: v(x) > Oforallv € S}. 


Skolem devised a method that, in modern terms, for some subvarieties X in 
an algebraic torus over R, could prove finiteness of X(R) or even determine it 
explicitly [Sko34]. His method was generalized by Chabauty [Cha38], who also 
adapted Skolem’s method to study rational points on a curve X in an abelian variety 
[Cha41]; see [MP12] for an introduction to the latter. Although we need only the 
torus case, it is not much extra work to describe the method in a more general setting, 
so we will do so. 

A semiabelian variety J is a commutative group variety fitting in an exact 
sequence | > T — J — A — 1 with T a torus and A an abelian variety. 


Proposition 2.1 Let R be a ring of S-integers in a number field k. Let J be a finite- 
type group scheme over R whose generic fiber Jx is a semiabelian variety. Then the 
abelian group J(R) is finitely generated. 


Proof By replacing k by a finite extension and enlarging S, we may assume that J 
fits in an exact sequence of R-group schemes 


1— G- J AS 1 


for some n > 0 and abelian scheme A over R. Taking R-points yields an exact 
sequence 


1 —> (R*)" — J(R) — A(R). (1) 


The group R% is finitely generated by the Dirichlet S-unit theorem. By the valuative 
criterion for properness, A(R) = A(K), which is finitely generated by the Mordell— 
Weil theorem. Now (1) shows that J(R) is finitely generated. oO 


For a group variety J over a field, say that a subvariety X C J generates J if, for 
some n, the addition morphism X” — J is surjective (which amounts to requiring 
that it gives a surjective map on points over an algebraically closed field). 


Proposition 2.2 Let J be a semiabelian variety over Qy. Equip J(Qp) with the 
p-adic topology. Let T be a finitely generated subgroup of a compact subgroup 
G < J(Q,). Let X be an irreducible curve over Qp. Lett: X > J be a morphism 
whose image generates J. IfrankT < dim J, then {x € X(Qp) : U(x) € TP} is finite. 


Proof (The details are analogous to those in [MP12, §4].) We may assume that z is 
proper. Since J(Q,) has a basis of neighborhoods of 1 consisting of compact open 
subgroups K, we may replace G by G+ K for any such K to assume that G is open 
in J(Q,). By [Bou98, III.§7.6], there is a canonical homomorphism 


log: G > LieG = Lie J. 
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The group logI’ is generated by at most rkT’ elements, and rk. < dimJ = 
dim(Lie J), so there is a linear functional 4: LieJ — Q, that vanishes on 
logI and even on its closure logl’. Pulling 4 back to the compact subset 
{x € X(Qp) : ex) € G} yields an analytic function 7 that is locally the integral of 
a nonzero |-form on X, so the zero locus of 7) is discrete, and hence finite. Finally, 
{x € X(Q,) : 4x) € I} is contained in the zero locus of n, by definition ofA. oO 


Theorem 2.3 (The Skolem—Chabauty Method) Let R be a ring of S-integers in 
a number field k. Let X be a finite-type separated R-scheme such that Xx is an 
irreducible curve. Let J be a finite-type separated R-group scheme such that Jy is 
a semiabelian variety. Let i: X; — J be a morphism whose image generates Jy. If 
tk J(R) < dim Jy, then X(R) is finite. 


Proof Let Ry C ky denote the completions of R C k at v. Let R= II e¢s Rv. LetA 
be the restricted product lug s(kv, Ry), so the R-algebra Ris open in the k-algebra 
A, and Rk = R. Since X(R) is compact and J(R) is open in JS(A), the map 
: X(A) — J(A) maps X (R) into a finite union of cosets of J (R). Intersecting 
with J(k) shows that 1 maps X (R) into a finite union of cosets of J(R) in J(k). 
Choose p ¢ S that is unramified of degree 1 over a prime p of Q. Then Rp ~ Zp 
and ky ~ Qp. Proposition 2.2 applied to 1g, with P = J(R) (finitely generated by 
Proposition 2.1) and G = J(Zp,) shows that {x € X(R) : ex) € J(R)} is finite. 
The same argument with 1 composed with a translation shows that {x € X(R) : 
u(x) € j + J(R)} is finite for each j € J(Q). By the first paragraph, X(R) is 
contained in a finite union of these. oO 


3 Proof of Siegel’s Theorem 


Let R = Z[S~'] and X = P} — {0, 1, oo}. Let ¢ be a prime. Let A be a (finite) set 
of representatives for R* /R**. For eacha € A, let tq: Y, —> X be the finite cover 
obtained as the inverse image of X under the morphism 


PL— PL 


yrKo ay’. 


Any element of X(R) C Gm(R) = R% is ay’ for some a € A and y € R* C 
P!(R), and then y € Y,(R) by definition of Y,. Thus X(R) = el er Ia (Yq(R)). It 
remains to prove that each set Y,(R) is finite. 

Let O = Rit I/(at® —1)and K = Q{t t]/(at® — 1). Ifa represents the trivial class 
in R*/R**, then K ~ Q x Ko for a number field Ko; otherwise define Ko := K, 
which is already a number field. Let Oo be the integral closure of R in Ko. 

We have Y, = Pe — {0, oo, zeros of ay® — 1}. Over Q, the generalized Jacobian 
of Yq (i.e., of P! with modulus consisting of the £ + 2 removed points) is a torus 
of dimension € + 1, and it has a natural model over R, namely J := Gmn.pr xX 
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Reso/r Gn,o, where Res denotes restriction of scalars. The usual morphism from 
(Yq)q to its generalized Jacobian Jg, up to translation in Jg, is y+ (y, y—f), and 
its image generates Jag. Skolem’s method (Theorem 2.3) applies if we can prove that 
rkJ(R) < €4+1. 

We will show that rk J/(R) < £+ 1 holds when @ is large. Below, O(1) denotes 
a quantity whose size depends only on X and S, not on ¢ or a. We have 


tk J(R) = rk R% + rk O% < 2rk RX + rk OX = O(1) + 1k OX. (2) 


Since Ko is of degree £ + O(1) and has at most one real place, the Dirichlet S-unit 
theorem implies that 


tkO>j = ¢/2 + #{primes in Ko above S} + O(1). (3) 


Let s = #S. For fixed p € S, the number of primes of Ko above p having degree 
< 3s is at most p>’, because these primes correspond to distinct irreducible factors 
of ax’ — 1 mod p of degree < 3s; on the other hand, there are at most £/(3s) 
primes of Ko above p having degree > 3s, because their degrees sum to at most 
[Ko : Q] < é. Thus 


#{primes in Ko above S} = > (p* + ¢/@s)) = €/34+ O(1). (4) 
pes 


Substituting (4) into (3) and then (3) into (2) yields 
tk J(R) < €/24+-£/3+ O11) < £41 


if @ is sufficiently large in terms of S. 


Remark 3.1 The proof does not seem to generalize readily to prove the analogue for 
rings of S-integers in number fields other than Q. The limitations of this approach 
are investigated thoroughly in [Tril9]. 


Acknowledgments I thank the referees for encouraging me to include an exposition of the 
Skolem—Chabauty method. 
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